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1 Introduction 



This lecture will elaborate on what a quantization is. If classical phase space, i. e. the 
space of possible positions and momenta, is r*R^ = x R^, we give a concrete solution. 
Roughly speaking, we generalize Dirac's prescription to replace x by "multiplication with 

X," 

{x^){x) := X (/?(x), if G L^CR^), 

and momentum by a "derivative with respect to x" 

(p(^)(x) := -ihV^ifM, if e L^(R^) n "^^(R^). 

Dirac's recipe only works if we are quantizing functions of x or p and their linear combi- 
nations, e. g. H(x,p) = -^p^ + y(x) becomes H = H(x,p) = — :^A^ + y(x), and it needs 
to be supplemented with an "operator ordering prescription:" what is the quantization of 
X -p? Possible solutions are 

x-p, p-x = x-p-i/2idi2, |(x-p+p-x), 

all of which are different operators and the first two are not even symmetric (= hermitian) . 
Choosing an operator ordering is necessary, because x and p do not commute, 

The physical constant h has a fixed value and units and it measures the "degree of non- 
commutativity" of position and momentum. On the other hand, can we find the function 
whose quantization is, say, x - p = ^jLi Pi^ One way to look at it is to find a product % 
which emulates the operator product on the level of functions, i. e. 

1=1 1=1 

On the one hand, this product must inherit the non-commutativity of the operator product, 
but on the other, if "h is small," x^tlPz should be approximately given by x^p^. One of 
the major reasons to study % is that we can use it to derive corrections in perturbation 
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expansions: if we can find an expansion of the product tJ/j in h, we can "expand the operator 
product:" 

00 

f g=fkg = ^^"(/fc^)(n) + small error 

n=0 

We will make all of the above mathematically rigorous in this lecture and explain the 
symbols properly This idea was first proposed by Littlejohn and Weigert [LW93 ] (who 
are theoretical physicists) and used both in a mathematically rigorous fashion and in the- 
oretical physics to derive currents in crystals that are subjected to electromagnetic fields 
IIPST03aJ , piezocurrents [IPST061 lLei05 ] , guiding center motion of particles in electro- 
magnetic fields ||Mu99ll , the non- and semirelativistic limit of the Dirac equation IIFL08II 
and many others. 

The second main point of this lecture is the semiclassical limit ("/l 0") : for h = 0, p 
and X commute. However, a priori it is not at all clear how and why this implies classical 
behavior. Let H(x,p) = + be a classical hamiltonian. If we start at t = at the 
point (xq.Po) m phase space, Hamilton's equations of motion 



p = -v,H = -v,y 

with initial conditions (xq.Po) determine a unique classical trajectory (x(t),p(t)). The 
quantum dynamics of a particle whose initial state is described by the wave function iJjq 
is given by the Schrddinger equation, 

ih—^P\t) = H^p\t)=i-^i-ihV,f + Vix))^p\t), ^p\o) = ^Po^L\R''). (1.1) 

If in order to "take the semiclassical h 0/' we naively set h = in the Schrodinger 
equation, we get the unintelligible equation 

o = y(x)v^°(t) 

which implies trivial dynamics. The reason is that the actual solution requires a bit more 
thought. There are simple ansatze such as WKB functions, coherent states and other 
wave packet approaches^, but this suggests that only special ("good") initial states have 
a good semiclassical limit. First of all, why should nature "choose" such rather special 
initial conditions? Wave packet approaches are usually based on the Ehrenfest theorem 

wave packet is a sharply peaked wave function whose envelope function varies slowly, just like in the case 
of a WKB function. The center of the wave packet is interpreted as "semiclassical position" and the time- 
derivative of this center gives the classical velocity. 
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which roughly states that "quantum expectation values behave like classical observables." 
Although this is always correct, it is not always interesting: consider the case of two wave 
packets on the line M that are identical in every respect, but run in opposite directions. 
The average position is always 0, but the dynamics is far from trivial. 

We will show that such special initial states are not necessary for semiclassical behavior, 
it is a generic phenomenon. The core is the Egorov theorem: 

Theorem 1.0.1 (Egorov) Let H{x,p) = + V(x) be a hamiltonian such that V G 

^°°(M^) and \d^Vix)\ < C^^/l-\-x^ for a e N^. Furthermore, let f G ^^°°(R^ x 

Rp be an observable. Then for h<il, we have 

e<^fe-'i^ -J7^, = m) 

where f denotes the Weyl quantization of f and the classical flow generated by H. 

The second goal of this lecture is to understand the Egorov theorem. Once all the symbols 
have been explained, the result is rather intuitive: it says that for "good" observables and 
"good" hamiltonians, the Heisenberg observable F^^(t) := e'^^^^fe~^^^ can be approxi- 
mated for macroscopic times by F^i : = / o One can show that this is the case for typical 
hamiltonians whose kinetic and potential energy are smooth and grow at most quadrat- 
ically. This is by no means the Egorov theorem in its most general form; in particular, it 
does not depend on the form of the hamiltonian. 2009.10.20 
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2 Physical Frameworks 



Understanding of quantization requires knowledge of classical and quantum mechanics. A 
quantization procedure is not merely a method to "consistently assign operators on L^(R^) 
to functions on phase space/' but rather a collection of procedures. A nice overview of the 
two frameworks can be found in the first few sections of chapter 5 in [ WalOSH and we will 
give a condensed account here: physical theories consist roughly of three parts: 

(i) A state space: states describe the current configuration of the system and need to be 
encoded in a mathematical structure. 

(ii) Observables: they represent quantities physicists would like to measure. Related to 
this is the idea of spectrum as the set of possible outcomes of measurements as well 
as expectation values (if ones deals with distributions of states) . 

(iii) An evolution equation: usually one is interested in the time evolution of states as 
well as observables. As energy is the observable conjugate to time, energy functions 
generate time evolution. 

2.1 Hamiltonian framework of classical mechanics 

For simplicity we only treat classical mechanics of a spinless point particle moving in M.^, 
for the more general theory we refer to [ MR99, Arn89 ] . In any standard lecture on 
classical mechanics, at least two points of views are covered: lagrangian and hamiltonian 
mechanics. 

We will only treat hamiltonian mechanics here: the dynamics is generated by the so- 
called hamilton function H : x — > R which describes the energy of the system 
for a given configuration. Here, R^ x R^ is also known as phase space. Since only energy 
dijferences are measurable, the hamiltonian := H -\- Eq, Eq g R, generates the same 
dynamics as H. This is obvious from the hamiltonian equations of motion, 

x(t) = +V^H(x(t),p(t)) (2.1) 
p(t) = -V,H(x(t),p(t)), 
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which can be rewritten in matrix notation as 

The matrix J appearing on the left-hand side is often called symplectic form and leads 
to a geometric point of view of classical mechanics. For fixed initial condition (xo,Po) ^ 
X at time = 0? i- initial position and momentum, the hamiltonian flow 

^ : Rf X R^ X R^ — ^ R^ X R^ (2.3) 

maps (xo,Po) o^^^o the trajectory which solves the hamiltonian equations of motion, 

^t(xo,Po) = (x(0),p(0)) = (xo,Po). 

If the flow exists for all t g , it has the following nice properties: for all t, G R^ and 
(xo,Po) ^ X we have 

(i) ^t(^t/(xo,Po)) =^t+t'(^o.Po). 

(ii) ^oixo,po) = ixo,po),and 

(iii) <l>t(<^>-t(xo,Po)) = ^t-t(^o.Po) = (^o.Po)- 

Mathematically, this means ^ is a group action of R^ (with respect to time translations) on 
phase space R^ x R^. This is a fancy way of saying: 

(i) If we first evolve for time t and then for time t\ this is the same as evolving for time 
t + t\ 

(ii) If we do not evolve at all in time, nothing changes. 

(iii) The system can be evolved forwards or backwards in time. 

The existence of the flow is usually proven via the Theorem of Picard and Lindelof. It 
holds in much broader generality and is no easier to prove if we specialize to X = (x,p) 
and r2^ =R^ xRf. 

Theorem 2.1.1 (Picard-Lindelof) Let F be a continuous vector fleld, F e ^(Lr,R"), U c 
R" open, which deflnes a system of differential equations, 

X = F(X). (2.4) 

Assume for a certain initial condition Xq there exists a ball Sp(Xo) := {X G R" | |X - 
Xq\ < p} c. u, p > 0, such that F is Lipschitz on Bp(Xq\ L e. there exists L > which 
satisfies 

|f(x)-f(xO| <l|x-x'| 
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for allX,X^ G Bp(Xo). Then the initial value problem, equation ( |2.4| ) withX(0) =Xq, has a 
unique solution t ^ X(t)for times \t\ < T := mm(^p/v^^^,'^/2L) where the maximal velocity 
is defined as V^^^ := supxeB^^xJPiX)\. 

Multiplying the hamiltonian equations of motion ( |2.2D by the inverse of the symplectic 
form, we get the vector field F explicitly in terms of the gradients of H with respect to x 
and p. 

Proof We only sketch the proof here since it is part of the standard course on analysis 
(for physicists) or differential equations (for mathematicians). It consists of three steps: 
We can rewrite the initial value problem equation ( |2.4| ) withX(O) =Xq as 

Xit)=Xo+ f dsF(X(s)) 
Jo 

This equation can be solved iteratively: we define X^^t) := Xq and the n + 1th iteration 
X„+i(t) := (P(X„))(t) in terms of the so-called Picard map 

(P(Xj)(t):=Xo+ f dsFiX.it)). 
Jo 

If t G [—T,-\-T] and T > is chosen to be small enough, P : 3^ — > ^ is a contraction on 
the space of trajectories which start at Xq, 

^ jy e ^([-r,+r],Bp(Xo)) | Y(0)=Xo}. 
X is a complete metric space if we use 

d(7,Z):= sup |7(t)-Z(t)| Y,Ze^ 

te[-T,+T] 

to measure distances between trajectories. A map P : 3^ — > ^ is a contraction if for all 
Y, Z there exists C < 1 such that 

d(P(7),P(Z)) <Cd(7,Z). 

If the Picard iteration is a contraction on X, since it is complete, the sequence (X„) con- 
verges to the unqiue solution of ( |2.4| ) X = P(X) by the Banach fix point theorem. 

To show that P is really a contraction, we need to treat both possible choices of T : the 
first, T < p/Vmax implies that the trajectory cannot leave the ball Bp(Xo). For any Y ^ X, 
we have 

\P(Y)-Xo 



dsF(7(s)) 



<tV^,,<TV^,,<p. 
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The second condition, T < ^/ii, together with the Lipschitz property ensure that it is also 
a contraction with = ^/2: for any Y,Z ^ ^ , we have 



d(P(7),P(Z))= sup 

te[-r,+r] 



ds[(F(7))(t)-(F(Z))(t)] 



<TL sup \Y{t)-Z{t)\<j-Ld{Y,Z)=\d{Y,Z) 

te[-r,+r] 

This concludes the proof. □ 

Corrolary 2.1.2 If the vector field F satisfies the Lipschitz condition globally, i. e. there exists 
L > such that 

|f(x)-f(xO| <l\x-x'\ 

holds for all XX ^ I^". then t ^ ^ti^o) exists globally for all t e and Xq e R". 

Proof For every g R", we can solve the initial value problem at least for \t\ < ^/2l. 
Since 

|f(Xo)|<|f(o)| + l|Xo| 

implies that if we choose our ball large enough, i. e. for any p > \Xq | the condition 
\t\< p/Vmax is automatically satisfied, 

p p 1 

> —. : > 



^max |F(0)|+L(|Xo|+p) 2L 

Hence, we can patch local solutions together using the group property o <|)^/ = $^_^^/ of 
the flow to obtain global solutions. □ 

Another important fact is that the flow <l> inherits the smoothness of the vector field which 
2009.10.21 generates it. 

Theorem 2.1.3 Assume the vector field F isk times continuously differentiable, F e "^^{U, R"), 
[/ c R". Then the flow ^ associated to ( |2.4| ) is also k times continuously differentiable, 
i. e. <l> G ^^([-r,+r] X V,U) where V dU is suitable. 

Proof We refer to Chapter 3, Section 7.3 in ||Arn92|| . □ 
The above results immediately apply to the hamiltonian equations of motion: 
Corrolary 2.1.4 Let H = + y(x) be the hamiltonian which generates the dynamics 



according to equation ( |2.2D such that V^V satisfies a global Lipschitz condition 

|v^y(x) - v^y(xO| <l\x- x'\ Vx,x' g r^. 

Then the hamiltonian flow $ exists for all t G R^. 
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Classical states Pure states in classical mechanics are simply points in phase space: a 
point particle's state at time t is characterized by its position x(t) and momentum pit). 
More generally one can consider distributions of initial conditions which are relevant in 
statistical mechanics, for instance. 

Definition 2.1.5 (Classical states) A classical state is a probability measure [jl on phase 
space, that is a positive Borel measure^ which is normed to 1, 

> for all Borel sets [/ c x 

/i(R^ XR^) = 1. 

Pure states are point measures, i. e. if {xq,Pq) g R^ x R^, then the associated pure state is 
given by M(xo,Po)(*) '= ^{x„p,)i') = ^i' - (^o.Po))-^ 

Observables Observables / such as position, momentum, angular momentum and en- 
ergy are smooth functions on phase space with values in R, / G S^°°(R^ x Rf,R). If we 
compose observables with the flow, we can work with time-evolved observables 

/(t):=/o$^ :R^ xR^ — >R 

where ^ is the flow generated by some suitable hamiltonian H. If (x(t),p(t)) is the tra- 
jectory associated to the initial conditions (xq,Pq), then (/(t))(xo,Po) — / (^t(^o^Po)) — 
/ (x(t),p(t)) gives the value of the observable at time t. This is analogous to the Heisen- 
berg picture in quantum mechanics: observables are evolved in time and states stay con- 
stant. The possible outcomes of measurements is given by the 

Definition 2.1.6 (Spectrum of an observable) The spectrum of a classical observables, i. e. the 
set of possible outcomes of measurements, is given by 

spec/ := /(R^ x R^) = imf. 

If we are given a classical state [jl, then the expected value of an observable / for the 
distribution of initial conditions /i is given by 

E^(/(t)) := d/i(x,p)/(c|>,(x,p)) = dM(<l>-t(x,p))/(x,p) 

^Unfortunately we do not have time to define Borel sets and Borel measures in this context. We refer the 
interested reader to chapter 1 of II LLP 111 . Essentially, a Borel measure assigns a "volume" to "nice" sets, 
i. e. Borel sets. 

^Here, 5 is the Dirac distribution which we will consider in detail in Chapter Isl 
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The right-hand side corresponds to the Schrddinger picture where states are evolved in 
time and not observables: the time-evolved state ju(t) associated to [jl is defined by /i(t) := 

Time evolution Now to the dynamics: we have already defined the time-evolved observ- 
able fit). Which equation generates its time evolution? 

Proposition 2.1.7 Let f e ^°°(R^ x ,R) be an observable and ^ the hamiltonian flow 

p 

which solves the equations of motion ( |2.2D associated to a hamiltonian H G '^°°(R^ x 
which we assume to exist globally in time for all (xq.Po) ^ x Then 

^/(t) = {H,/(t)} (2.5) 

holds where {f,g} := SjLi i^pj ^xiS ~ ^xj ^piS) ^he so-called Poisson bracket. 

Proof Theorem |2.1.3| implies the smoothness of the flow from the smoothness of the 
hamiltonian. This means /(t) G '^^(R^ x R^,M) is again a classical observable. By as- 
sumption, all initial conditions lead to trajectories that exist globally in time."^ For (xqjPq), 
we compute the time derivative of /(t) to be 

^/(t)^ (xo,Po) = ^/(^(aKt)) 

d 

= XI [^^i^ ^ti^o,Po)xiit) + dpj o <l>t(xo,Po)Pz(0) 

1=1 

d 

1=1 

+ ^pj o ^t(^o.Po) {-^xH o <l>t(Xo,Po))) 
= {H(t)J(t)}. 

In the step marked with we have inserted the hamiltonian equations of motion. Com- 
pared to equation ( |2.5D , we have H instead of H(t) as argument in the Poisson bracket. 
However, by setting f(t) = H(t) in the above equation, we see that energy is a conserved 
quantity, 

^H(t) = {H(t),H(t)} = 0. 
Hence, we can replace H(t) by H in the Poisson bracket with / and obtain equation ( |2.5D .n 

slightly more sophisticated argument shows that the Proposition holds if the hamiltonian flow exists only 
locally in time. 
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The proof immediately leads to the notion of conserved quantity: 

Definition 2.1.8 (Conserved quantity/ constant of motion) An observable f G S^°°(R^x 
R^,R) which is invariant under the flow ^ generated by the hamiltonian H G S^°°(R^ x 
R^,R), i. e. 

fit)=f (01 

or equivalently satisfies 

^/(t) = {H,/(t)} = 0, 
is called conserved quantity or constant of motion. 

As is very often in physics and mathematics, we have completed the circle: starting from 
the hamiltonian equations of motion, we have proven that the time evolution of observ- 
ables is given by the Poisson bracket. Alternatively we could have started by postulating 

^/(t) = {H,/(t)} 

for observables and we would have arrived at the hamiltonian equations of motion by 
plugging in x and p as observables. Another important fact is Liouville's Theorem which 
states that the hamiltonian flow preserves phase space volume: 

Theorem 2.1.9 (Liouville) The hamiltonian vector field is divergence free, i. e. the hamil- 
tonian flow preserves volume in phase space of bounded subsets V o/ R^ x Rf with smooth 
boundary d V. In particular, the functional determinant of the flow is constant and equal to 

dei{D^,ix,p))=l 

for all t G Rf and (x,p) G R^ x R^. 

Remark 2.1.10 We will need a fact from the theory of dynamical systems: if <l>f is the 
flow associated to a differential equation X = F(X) with F G ^^(R'^^R'^), then 

holds for the differential of the flow. As a consequence, one can prove 

^ (detD<l>,(X)) = tr (DF(<I>,(X))) det (D<I>,(X)) 
= divF(<l>,(X)) det (D^.iX)), 
and det [D^^)\^^^ = 1. We refer to l|Dre071 Chapter 1.6] for details and proofs. 
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Proof Let H be the hamiltonian which generates the flow <l>f . Let us denote the hamilto- 
nian vector field by 

X - ( 

Then a direct calculation yields 

d 

divXn =Y,[d,, i+dp,H) + 5^, (-a„H)) = 

1 = 1 

and the hamiltonian vector field is divergence free. This implies the hamiltonian flow 
preserves volumes in phase space: let V c x be a bounded region in phase space (a 
Borel subset) with smooth boundary Then for all — T < t < T for which the flow exists, 
we have 

^voi(<i>,(y)) = ^ I dxdp 



dx'dp'det (D$t(x^pO). 

Since V is bounded, we can bound det {D^^) and its time derivative uniformly Thus, can 
interchange integration and differentiation and apply Remark [2. 1.10| 



d 
dt 



— dxMpMet(D^>,(x^pO) = dxMp'--det(D<J>,(x^pO) 
at Jy at 

= 1 dx'dp'divXH(*tUV))det(D*,(x',pO) 
Jv ^ ^ ^ 

=0 

= 0. 
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_d 
dt 

tional determinant of the flow is constant as the time derivative vanishes, 



Hence — Vol(y) = and the hamiltonian flow conserves phase space volume. The func- 



^det {D^,ix\p'))=0, 

and equal to 1, 

det (D<l>f(x^pO) If^o = detid^dxM^^ = 1- 
This concludes the proof. □ 

With a different proof relying on alternating multilinear forms, the requirements on V can 
be lifted, see e. g. I|KS04[ Satz 29.3 and Satz 29.5]. 

Corrolary 2.1.11 Let [ibe a state on phase space x and the flow generated by a 

X p 

hamiltonian H G S^°°(M^ x Rp which we assume to exist for |t| < T where < T < oo is 
suitable. Then ii(t) is again a state. 

Proof Since is continuous, it is also measurable. Thus ju(t) = [lo c^_^ is also a Borel 
measure on R^ x R^ (<|)_f exists by assumption on t). In fact, is a diffeomorphism on 
phase space. Liouville's theorem not only ensures that the measure jtx(t) remains positive, 
but also that it is normalized to 1 : let [/ c x R^ be a Borel subset. Then we conclude 



(M(t))([/)= d(M(t))(x,p)= dM(<f-,(x,p)) 

lu Ju 

dju(x,p) > 

where we have used the positivity of [jl and the fact that ^_^(U) is again a Borel set by 
continuity of <l>_f . If we set [/ = R^ x R^ and use the fact that the flow is a diffeomorphism, 
we see that R^ x R^ is mapped onto itself, <l>_t(R^ x Rp = R^ x R^ and the normalization 
of leads to 

(/i(t))(R^ X Rp = dM(x,p) = d/i(x,p) = 1 



This concludes the proof. 



2.2 Quantum mechanics 

While position and momentum characterize the state of a classical particle, both are not 
simultaneously measurable with arbitrary precision in a quantum system. This is forbidden 
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by Heisenberg's uncertainty principle, 

h 

where 

A^x := v^Var^(x) := ^ ('0,(x - (i/^, x'0))2'0) 

and A^p := ^JVdixip) dace the standard deviations of position and momentum with respect 
to the state i/^. They quantify how sharply the wave function i/^ is peaked in position 
and momentum space. Either we can pinpoint the location with increased precision (re- 
duce the standard deviation A^x) or measure the particle's momentum more accurately 
2009.10.22 (reduce A^p). In contrast, the variance (and thus the standard deviation) of a classical 

observable with respect to a pure state vanishes, 

vara,,,,, (/) :=E5,„„, (Cf - = ^e,,,jn - E5,„„,(/)^ 

=f\xo,Po)-fixo,Pof = 0. 

Quantum particles simultaneously have wave and particle character: the Schrodinger 
equation 

o t 

is structurally very similar to a wave equation. H is the so-called hamilton operator (hamil- 
tonian for short) and is typically given by 

H=^(-ihV,f + V(x). 

Here m > is the mass of the particle and V the potential it has been subjected to. The 
physical constant h relates the energy of a particle with the associated wavelength and 
has units of [energy • time] . 

Pure states are described by wave functions, i. e. complex- valued, square integrable 
functions 

gl2(R^) := jV^ — ^ C 1 1/^ measurable, dx |'0(x)|^ < oo}/ - . (2.6) 
L^(R^) with the usual scalar product 

:= I dx(/>(x)*V^(x) (2.7) 
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and norm Hi/^ || := VT^/vi/^ is the prototype of a Hilbert space. ~ means we identify two 
functions if they agree almost everywhere. We will discuss this space in much more detail 
in Chapter [s] and we focus on the physical content rather than mathematics. In physics 
text books, one usually encounters the the bra-ket notation: here ^ip) is a state and (xlip) 
is i/^(x). The scalar product of G L^(R^) is denoted by {4>\ip) and corresponds to 
{<p,ip). Although bra-ket notation can be ambiguous, it is sometimes useful and in fact 
used in mathematics every once in a while. 

Physically |i/;(x, t)| is interpreted as the probability to measure a particle at time t in 
(an infinitesimally small box located in) location x. If we are interested in the probability 
that we can measure a particle in a region A c we have to integrate |i/^(x, t)| over 
A, 

P(X(t)GA)= dx |l/;(x,t)|^ (2.8) 

J A 

If we want to interpret li/^P as probability density, the wave function has to be normalized, 



1. e. 



This point of view is called Born rule: could either be a mass or charge density - or 
a probability density. To settle this, physicists have performed the double slit experiment 
with an electron source of low flux. If |i/;| were a density, one would see the whole 
interference pattern building up slowly. Instead, one measures "single impacts" of elec- 
trons and the result is similar to the data obtained from experiments in statistics (e. g. the 
Dalton board). Hence, we speak of particles. 



Quantum observables Quantities that can be measured are represented by symmetric 
(hermitian) operators A on the Hilbert space (typically L^(R^)), i. e. special linear maps 

A : ^(A) c L^(R^) — > L^(R^). 

Here, ^(A) is the domain of the operator since typical observables are not defined for 
all G L^(R^). This is not a mathematical subtlety with no physical content, quite the 
contrary: consider the observabble energy, typically given by 

H=-^(-mvj2 + y(x), 

2m 

then states in the domain 

^(H) := ji/; G l2(R^) I HxIj e l2(R^)} c l2(R^) 
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are those of finite energy. For all in the domain of the hamiltonian ^(H) c L^(R^), the 
expectation value 

{il),Hil)) < 00 

is bounded. Well-defined observables have domains that are dense in L^(R^). Similarly, 
states in the domain ^(x/) of the Zth component of the position operator are those that 
are "localized in a finite region" in the sense of expectation values. 

Physically, results of measurements are real which is reflected in the selfadjointness of 
operators/ 

The spectrum cr(H) c R is the set of all possible outcomes of measurements. Unfortu- 



nately we cannot define the spectrum now, but we will do so in Chapter 4.1 

Typically one "guesses" quantum observables from classical observables: in d = 3, the 
angular momentum operator is given by 

L = xAp = xA (— i/iV^). 

In the simplest case, one uses Dirac's recipe (replace x by x and p by p = —ihV^) on the 
classical observable angular momentum L(x,p) = x Ap. In other words, many quantum 
observables are obtained as quantizations of classical observables: examples are position, 
momentum and energy. Moreover, the interpretation of, say, the angular momentum op- 
erator as angular momentum is taken from classical mechanics. This is an indication how 
important quantizations are conceptually. 

In the definition of the domain, we have already used the definition of expectation 
value: the expectation value of an observable A with respect to a state (which we 
assume to be normalized, = 1) is given by 

E^iA):={xljM)' (2.9) 

The expectation value is finite if the state ijj is in the domain ^(A). The Born rule of 
quantum mechanics tells us that if we repeat an experiment measuring the observable A 
many times for a particle that is prepared in the state each time, the statistical average 
calculated according to the relative frequencies converges to the expectation value E^(A). 

Hence, quantum observables, selfadjoint operators on Hilbert spaces, are bookkeeping 
devices that have two components: 

(i) a set of possible outcomes of measurements, the spectrum a (A), and 

(ii) statistics, i. e. how often a possible outcome occurs. 

If for two quantum observables A and B have the same spectrum and statistics, they have 
the chance of being "equivalent." 



symmetric operator is selfadjoint if H* = H and ^(H*) = ^(H), see Chapter 4.4 
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Quantum states Pure states are wave functions g L^(R^), or rather, wave functions 
up to a total phase: just like one can measure only energy differences, only phase shifts are 
accessible to measurements. Hence, one can think of pure states as orthogonal projections 

P^:= \^)(xP\ = (^,-)^. 

if xjj is normalized to 1, Hi/^H = 1. Here, one can see the elegance of bra-ket notation vs. 
the notation that is "mathematically proper." A generalization of this concept are density 
operators p (often called density matrices): density matrices are defined via the trace. If p 
is a suitable linear operator and {v^nlneN orthonormal basis of L^(R^), then we define 

nGN 

One can easily check that this definition is independent of the choice of basis and we will 
show this in a later chapter. Clearly, has trace 1 and it is also positive in the sense that 

for all if G L^(R^). This is also the good definition for quantum states: 

Definition 2.2.1 (Quantum state) A quantum state (or density operator /matrix) p is a 
positive operator of trace 1, i. e. 

(V^,P'0) >o, 

trp = l, Vi/^GL^CR^). 

If p is also an orthogonal projection, i.e.p^ = p,itisa pure state.^ Otherwise p is a mixed 
state. 

Example Let i/^^ G L^{M.^) be two wave functions normalized to 1. Then for any < a < 1 

p = aP^^ + (1 - a)P^^ = a|'0i)('0i| + (1 - a)|'02>('02l 
is a mixed state as 

p2 = a2|Vi>(t/'il + (l-a)2|a/.2)(V2l+ 

+ a(l - a)(|a/.i)(a/.i||V2>(V'2l + l^2)(V'2ll^i)(^il) 

Even if ipi and 1/^2 orthogonal to each other, since ^ a and similarly (1 — a)^ / 
(1 — a), p cannot be a projection. Nevertheless, it is a state since trp = a + (1 — a) = 1. 
Keep in mind that p does not project on aipi + (1 - a)'02' 

^Note that the condition trp = 1 imphes that p is a bounded operator while the positivity impHes the selfad- 
jointness. Hence, if p is a projection, i. e. p^ = p, it is automatically also an orthogonal projection. 
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Time evolution The time evolution is determined through the Schrddinger equation, 

ih—xPit) = ftxPit), xPiO e L^R''), xPiO) = xPo, \\^o\\ = 1- (2-10) 

Alternatively, one can write ip[t) = U{t)ipQ with t/(0) = idi2. Then, we have 

ih^mt) = HU(t), {/(0) = idi2. 

o t 

If H were a number, one would immediately use the ansatz 

Uit) = e-'n^ (2.11) 

as solution to the Schrodinger equation. If H is a selfadjoint operator, this is still true, but 
takes a lot of work to justify rigorously if the domain of H is not all of L^(R^) (the case of 
unbounded operators, the generic case) . 

As has already been mentioned, we can evolve either states or observables in time and 
one speaks of the Schrodinger or Heisenberg picture, respectively. In the Schrodinger 
picture, states evolve according to 

^(t) = U[t)^o 

while observables remain fixed. Conversely, in the Heisenberg picture, states are kept 
fixed in time and observables evolve according to 

A(t) := UitTAUit) = e^'n^Ae-'n^. (2.12) 

Heisenberg observables satisfy 

^^Ait)='-[H,Ait)] (2.13) 

2009.10.27 which can be checked by plugging in the definition of A(t) and elementary /ormaZ manipu- 

lations. It is no coincidence that this equation looks structurally similar to equation ( |2.5| )! 

As a last point, we mention the conservation of probability : if i/^(t) solves the Schrodinger 
equation for some selfadjoint H, then we can check at least formally that the time evolu- 
tion is unitary and thus preserves probability, 

= ^((^(t),i?*i/>(t)) - (^(t),H^(t))) 
= '-{xP(tUH*-Hmt))=0. 
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Observables 

Building block observables 
Possible results of measurements 
States 
Pure states 

Generator of evolution 
Infinitesimal time evolution equation 
Integrated time evolution 



Classical 

X p 

position X and momentum p 
im(/) 

probability measures jU on phase sj 
points in phase space x 
hamiltonian function H : x - 
^/(f) = {H,/(t)} 
hamiltonian flow 0^ 



Table 2.1: Comparison of classical and quantum framework 



Conservation of probability is reminiscent of Corollary 2.1.11 We see that the condition 
=H is the key here: selfadjoint operators generate unitary evolution groups. As a mat- 
ter of fact, there are cases when one u^ants to violate conservation of proability: one has 
to introduce so-called optical potentials which simulate particle creation and annihilation. 

The time evolution e~^^^ is not the only unitary group of interest, other commonly 
used examples are translations in position or momentum which are generated by the 
momentum and position operator, respectively (the order is reversed!), as well as rotations 
which are generated by the angular momentum operators. 



2.3 Comparison of the two frameworks 

Now that we have an understanding of the structures of classical and quantum mechan- 
ics, juxtaposed in Table |2.1[ we can elaborate on the differences and similarities of both 
theories. For instance, observables form an algebra (a vector space with multiplication): 
in classical mechanics, we use the pointv^^ise product of functions, 

. : ^°°(R^ X Rp X ^°°(R^ xRp^ ^°°(R^ x Rp, (/, g) / • g 
(/ • g)ix,p) :=f(x,p) g(x,p), 

which is obviously commutative. We also admit compZex-valued functions and add complex 
conjugation as involution (i. e. /** = /). Lastly, we add the Poisson bracket to make 
*^^(R^ xRp into a so-called Poisson algebra. As we have seen, the notion of Poisson 
bracket gives rise to dynamics as soon as we choose an energy function (hamiltonian) . 
On the quantum side, bounded operators (see Chapter |4.1| ) form an algebra. This 
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algebra is non-commutative, i. e. 

Exactly this is what makes quantum mechanics dijferent. Taking adjoints is the involution 
here and the commutator plays the role of the Poisson bracket. Again, once a hamiltonian 
(operator) is chosen, the dynamics of Heisenberg observables A(t) is determined by the 
commutator of the A(t) with the hamiltonian H. If an operator commutes with the hamil- 



tonian, it is a constant of motion. This is in analogy with Definition 2.1.8 where a classical 
observable is a constant of motion if and only if its Poisson bracket with the hamiltonian 
(function) vanishes. 



2.4 Properties of quantizations 

Quantization is a problem of theoretical and mathematical physics, not a problem of na- 
ture. Quantum mechanics is simply more fundamental than classical mechanics and the 
reason we can guess the correct quantum mechanics comes from the fact that most ob- 
servables we are interested are macroscopic and have a good semiclassical limit. In a way, 
quantization is going backwards again. That is the reason why not every quantum observ- 
able has a classical analog. Spin, for instance, is purely a quantum mechanical concept 
with no semiclassical limit. 

Since quantum mechanics is more deep and classical mechanics emerges as an approx- 
imation under certain conditions, there should be "more" quantum observables than clas- 
sical observables (although it is usually not possible to qualify what we mean by "more," 
the frameworks and constructions are too different) . In the simplest case, we start with an 
algebra of classical observables and "quantize them," i. e. we associate them to operators 
on a Hilbert space in a systematic fashion. Although this is what most people think is a 
quantization, we emphasize that this is only part of what really is a consistent quantization 
procedure. 

Let us call the quantization map Dp which promotes suitable functions on phase space 
to operators on L^(R^). A number of requirements on Dp seem natural: 



Linearity The map Dp should be linear, i. e. for two classical observables /, g ^ 21^ taken 
from the algebra of classical observables and a, ^ G C, we should have 

Dp{af + 15 g) = aDpif) + 15 Dpig) e 

Here, 2tqm is an algebra of quantum observables. 



20 



2.4 Properties of quantizations 



Compatibility with involution Dp should intertwine complex conjugation and taking ad- 
joints, i. e. for all / G 21^ 

Products The two products cannot be equivalent: the operator product is noncommuta- 
tive and hence for general /, g G 21^ 

Op(/-g)^£)p(/)-£)p(g). 

Instead, for suitable functions /, g, we can define a non-commutative product tt on the level 
of functions on phase space such that 

A priori it is not at all clear whether /Jg G 21^. 

Poisson bracket and commutator Roughly, Poisson bracket and commutators play simi- 
lar roles and thus are analogs of one another. 

Just like with the product, the quantization of the Poisson bracket usually does not coin- 
cide with with i/ift times the commutator. 2009.10.28 

Small parameters One of the most important (and potentially confusing) aspects of the 
semiclassical limit is the question of the small parameter. Planck's constant ^ is a physical 
constant, i. e. its value is fixed and it has units. Any good small parameter should not 
have units, e. g. the fine structure constant a ~ V 137 15 a good small parameter. Hence, 
we cannot take the limit h ^ 0. Other interpretations of the semiclassical limit point in 
the right direction: in its earliest version, the semiclassical limit was the limit of "large 
quantum numbers." Translated to a more modern setting, this means that the ratio of 
typical energies to the energy level spacing is large. Rydberg states of hydrogen-like atoms, 
i. e. states for which the main quantum number is in the range n > 50, are probably 
the best-known example. Although this particular mechanism is limited to hamiltonians 
with non-trivial discrete spectrum, it suggests the root cause of (semi) classical behavior 
is the existence of two scales and the semiclassical parameter which from now on we will 
denote with s is related to the ratio of these scales. We give two examples: 
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(i) Separation of spatial scales. Quantum objects are typically much, much smaller 
than "classical" objects and the ratio of typical quantum to typical classical length 
scales is small. External electromagnetic fields, for instance, vary on the macroscopic 
scale, i. e. they change slowly on the microscopic scale. In solid state physics, the 
microscopic length scale is of the order of 10 — 100 A (e. g. measured in terms of the 
typical lattice spacing of a crystal or the localization length of a particle) while the 
macroscopic length scale is around 10^ — 10~^mm; hence, the separation of scales 
is approximately s < 10""^. 

(ii) Ratio of masses /Born-Oppenheimer-type systems. Born-Oppenheimer hamiltonians 
are simply many-body hamiltonians which are used to model molecules. Here, there 
are two types of particles: heavy (slow) nucleons and light (fast) electrons. The 
square root of the ratio of electronic and nucleonic mass is small, 

and this is the reason why nucleons tend to behave classically. Born-Oppenheimer 
systems will be the main focus of Chapter [s] 

However, we caution that the existence of small parameters - even in the right places - 
implies a semiclassical limit: if we take the semirelativistic limit of the Dirac equation, 
then s = ^o/c appears naturally in front of the gradient where Vq is a typical velocity and 
c is the speed of light. If Vq is well below c (the electron is much slower than the speed 
of light), no electron-positron pairs are created (since the kinetic energy is below the pair 
creation threshold of 2mc^) and electrons and positrons decouple. Hence, the ^ ^ limit 
need not have the interpretation as semiclassical limit. 

We see that the reason for different small parameters is that semiclassical behavior can 
be due to very different physical mechanisms! 

If the semiclassical parameter e is small, we expect 

Opif'g) = Opif)'Dpig) + ^i8) 

as well as 

Opi{f,g}) = ^ [£)p(/),Dp(g)] + 0(sl 

The latter equation is key in the derivation of the semiclassical limit (the Egorov Theo- 
rem 7.1) whose validity can be tested by experiment. 

For completeness, we mention that we could have quantized classical states instead of 
observables. This is the point of view of Berezin [|Ber75|] . 
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3 Hilbert spaces 



We will give a few basic facts on Hilbert spaces. This is not intended to be a replacement 
for a lecture on functional analysis. For a more in depth look on the subject, we refer to 
I1RS8T1IWE€51III0TII 



3-1 Prototypical Hilbert spaces: L^(R^) and £^(Z^) 

We have already introduced the space of square integrable functions on R^, 
^^(R'^)'= iif -.R"^ — > C I measurable, dx | (/?(x)|^ < oo j, 

when talking about wave functions. The Born rule states that |'i/;(x)|^ is to be interpreted 
as a probability density on for position. Hence, we are interested in solutions to the 
Schrodinger equation which are also square integrable. When we say integrable, we mean 
integrable with respect to the Lebesgue measure [iLLOTl p. 6 ff.]. ^2(R^) is a C-vector 
space, but 




is not a norm: there are functions 7^ for which ||(/?|| = 0. Instead, ||(/?|| = only 
ensures 

ip(x) = almost everywhere (with respect to the Lebesgue measure dx). 

Almost everywhere is sometimes abbreviated with a. e. and the terms "almost surely" 
and "for almost all x G R^" can be used synonymously. If we introduce the equivalence 
relation 

~ 1/; ||(/? — =0, 
then we can define the vector space L^(R^): 
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Definition 3.1.1 (L^CR^)) We define L^R"^) as 

where ~ is the equivalence relation that identifies ip and ijj if \\ip — xjj^^ = 0. 

If if I ~ are two normalized functions in if ^(R^), then we get the same probabilities 
for both: if A c R^ is a measurable set, then 

Pi(XGA)= dx\^,Mf= dx |(^2Wr=P2(^^A). 

J A J A 

This is proven via the triangle inequality and the Cauchy-Schwartz inequality (we will 
prove the latter in the next chapter): 



0< |Pi(XgA)-P2(XgA)| = 



I 



dx |(/?i(x)|^- dx \ip2M[ 

J A 



dx((/?i(x)-(/?2(^)) ^i(^)- dx(/?2(^)*(^i(^)-^2(-^)) 

/A J A 

< dx|(/?i(x)-(/?2(x)| |(/?i(x)| - dx|(/?2(x)| |(/?i(x)-(/?2(x)| 

J A J A 

< ||^1-^2|| ||^l|| + ||^2|| ||^1-^2|| =0 

Very often, another space is used in applications (e. g. in tight-binding models): 
Definition 3.1.2 iP{S)) Let S be a countable set. Then 

l\S) :=[c:S^c\ J].^, c*c,- < oo} 

is the space of square-summable sequences. 



2009.11.03 



On P(S) the scalar product (c,c^) := Yijes^]^'j induces the norm ||c|| := ^(cTcO. With 
respect to this norm, P(S)is complete. 



3.2 Abstract Hilbert spaces 



We will only consider vector spaces over C, although much of what we do works just fine 
if the field of scalars is R. 
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Definition 3.2.1 (Metric space) Let 3C hea set. A mapping d : 3C x 3C — > [0, +oo) with 
properties 

(i) dix,y) = exactly if x = y, 

(ii) dix,y) = diy,x) (symmetry), and 

(Hi) d(x,%) < d(x, j) + d(y,z) (triangle inequality), 

for all x,y,z ^ 3C is called metric. We refer to {3C,d) as metric space (often only denoted 
as 3C). A metric space {3C , d) is called complete if all Cauchy sequences (x„) (with respect to 
the metric) converge to some x 

A metric gives a notion of distance - and thus a notion of convergence and open sets (a 
topology) : quite naturally, one considers the topology generated by open balls defined in 
terms of d . There are more general ways to study convergence and alternative topologies 
(e. g. Frechet topologies or weak topologies) can be both useful and necessary. 

Example (i) Let 3^ he a set and define 



It is easy to see d satisfies the axioms of a metric and 3^ is complete with respect to 
d. This particular choice leads to the discrete topology. 

(ii) Let ^ = "^{[a, b], C) be the space of continuous functions on an interval. Then one 
naturally considers the metric 



with respect to which "^([a, b],C) is complete. 

A more peculiar way of measuring distances - one which is adapted to the linear structure 
of vector spaces already - are norms. 



Definition 3.2.2 (Normed space) Let 3Chea vector space. A mapping |H| : 3C — » [0, +oo) 
with properties 

(i) \\x\\ = Oifandonlyifx = 0, 

(ii) ||ax|| = |a| Wxland 
(Hi) ||x + j||<||x|| + ||j|L 





(^ooU.g)'= sup |/(x)-g(x: 



= max 

xe[a,b] 




xe[a,b] 
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for all x,y ^ dC, a G C, is called norm. The pair ||-||) is then referred to as normed 
space. 

A norm on X quite naturally induces a metric by setting 

dix,y) := \\x-y\\ 

for all x,y G X. Unless specifically mentioned otherwise, one always works with the 
metric induced by the norm. 

Definition 3.2.3 (Banach space) A complete normed space is a Banach space. 

Example (i) The space X = '^([a,b],C) from the previous list of examples has a 
norm, the sup norm 

11/ L= sup 

xe[a,b] 

Since "^([a, b],C) is complete, it is a Banach space, 
(ii) Another important example are the spaces we will get to know in Chapter [3^ 
If we add even more structure, we arrive at the notion of 

Definition 3.2.4 (pre-Hilbert space and Hilbert space) Apre-Hilbert space is a complex 
vector space with scalar product 

(•,•): X — >C, 

i. e. a mapping with properties 

(i) and {^,^) =0 implies Lp = (positive definiteness), 

(ii) {(p,^ljy = {"4^,^), and 

(Hi) {ip,aii)^x)=(^{^,'^)^{^a) 

for all if,ip,x ^ ^ and a e C. This induces a natural norm ||(/?|| := y/i^pT^ and metric 
d((/?,'0):=||(/?— J^. If is complete with respect to the induced metric, it is a 
Hilbert space. 

Example (i) C" with scalar product 

n 

1=1 

is a Hilbert space. 
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(ii) [a, b] , C) with scalar product 

{f,g):= [ dxfixTgix) 

J a 

is just a pre-Hilbert space, since it is not complete. 

3.3 Orthonormal bases and orthogonal subspaces 

Hilbert spaces have the important notion of orthonormal vectors and sequences which do 
not exist in Banach spaces. 

Definition 3.3.1 (Orthonormal set) Let be a countable index set. A family of vectors 
{^fclfcGj? ^ called orthonormal set if for all kj GJ^ 

i^k^^j) =5kj 

holds. 

As we will see, all vectors in a separable Hilbert spaces can be written in terms of a 
countable orthonormal basis. Especially when we want to approximate elements in a 
Hilbert space by elements in a proper closed subspace, the vector of best approximation 
can be written as a linear combination of basis vectors. 

Definition 3.3.2 (Orthonormal basis) Let J be a countable index set An orthonormal set 
of vectors ^ called orthonormal basis if and only if for all i/^ G we have 



k^J 



If ^ is countably infinite, = N, then this means the sequence ip^^ := of 
partial converges in norm to ijj, 



lim 

n^oo 



= 



With this general notion of orthogonality, we have a Pythagorean theorem: 

Theorem 3.3.3 (Pythagoras) Given a finite orthonormal family {(/^i, . . in a pre- 
Hilbert space 3^ and kp ^^,we have 
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Proof It is easy to check that i/^ := Yik=i^^k> ^) ^nd xjj^ •= - XiLi^^fc' ^) ^fc 
orthogonal and = i/^ + 1/^-^. Hence, we obtain 

This concludes the proof. □ 
A simple corollary are BesseFs inequality and the Cauchy-Schwarz inequality 
Theorem 3.3.4 Let J>f be a pre-Hilbert space. 

(i) BesseVs inequality holds: Zet be a finite orthonormal sequence. Then 

n 

holds for allxpej^f. 

(ii) The Cauchy-Schwarz inequality holds, i. e. 

\{^,^P)\<\mm 

is valid for allif,xpej>^ 
Proof (i) This follows trivially from the previous Theorem as Hi/^-^lP > 0. 
(ii) Pick ^ ^ . In case = 0, the inequality holds. So assume / and define 

which has norm 1. We can apply (i) for n = 1 to conclude 

This is equivalent to the Cauchy-Schwarz inequality. □ 

An important corollary says that the scalar product is continuous with respect to the norm 
topology. This is not at all surprising, after all the norm is induced by the scalar product! 

Corrolary 3.3.5 Let ^bea Hilbert space. Then the scalar product is continuous with respect 
to the norm topology, i. e. for two sequences (^„)neN ('0m)meN ^^^^ converge to if and 
il), respectively, we have 

lim {iPn,'^m) = {^^'^)' 
n,m—*oo 
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Proof Let i^n)n&N ('0m)meN two sequeiices in that converge to and ijj, re- 
spectively. Then by Cauchy-Schwarz, we have 

lim \{'P,ll^) - {'Pn^ll^m)\= |(^ - V^n.'^) - (^n.'0m-'0)| 

< Hm I '0)1+ lim \{(p ^,11^^-11^)1 

< lim ||<^-<^J| 11^11+ lim M\\xl,^-xl,\\=0 

n,m^oo n,m-^oo 

since there exists some C > such that || || < C for all n G N. □ 

Definition 3.3.6 (Separable Hilbert space) A Hilbert space Jif is called separable if there 
exists a countable dense subset 

Before we prove that a Hilbert space is separable exactly if it admits a countable basis, we 
need to introduce the notion of orthogonal complement: if A is a subset of a pre-Hilbert 
space Jif , then we define 

A-^ I {ip,ilj) = 0\filj gA}. 

The following few properties of the orthogonal complement follow immediately from its 
definition: 

(i) {0}^ = J^andJf^ = {0}. 

(ii) is a closed linear subspace of for any subset A c jf, 

(iii) IfACB, thenB-^ CA"^. 

(iv) If we denote the sub vector space spanned by the elements in A by span A, we have 

A"^ = (span A) = (spanA)"^ 

where spanA is the completion of spanA with respect to the norm topology 

If is a metric space, we can define the distance between a point G and a 

subset AC as 

d((/?,A) := inf d((/?,'0). 

If there exists E:A which minimizes the distance, i. e. d((/?,A) = d((/?, ^q), then (^q is 
called element of best approximation for (/? in A. This notion is helpful to understand why 
and how elements in an infinite-dimensional Hilbert space can be approximated by finite 
linear combinations - something that is used in numerics all the time. 

If A c Jif is a convex subset of a Hilbert space Jif , then one can show that there always 
exists an element of best approximation. In case A is a linear subspace of Jif , it is given by 
projecting an arbitrary ijj GJf down to the subspace A. 
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Theorem 3.3.7 Let Abe a closed convex subset of a Hilbert space Jif. Then there exists for 
each G Jif exactly one ^ A such that 

Proof We choose a sequence ('^n)^^^ with d{i^,ipj^) = ||(/? — V^/iH d{x,A). This 
sequence is also a Cauchy sequence: we add and subtract if to get 

If were a normed space, we could have to use the triangle inequality to estimate the 
right-hand side from above. However, isa Hilbert space and by using the parallelogram 
identity,^ we see that the right-hand side is actually equal to 



=2||i/^,-(/p|| +2||i/;^-(/p|| -||i/;, + i/;^-2(/p|| 



<2\\^lj,-iff + 2\\^lj^-iff-4diif,A) 
2d((/?,A) + 2diip,A) - 4d((/?,A) = 0. 



By convexity, |('i/^n + '^m) is again an element of A This is crucial once again for the 
uniqueness argument. Letting n,m ^ oo, we see that is ^ Cauchy sequence in 

A which converges in A as it is a closed subset of Jif . Let us call the limit point (/?o -= 
lim^^oQ tpj^. Then (^q is an element of best approximation, 

|k-^o|| = lim||(/?-'0„|| =d((/?,A). 

To show uniqueness, we assume that there exists another element of best approximation 

G A. Define the sequence ('iprJneN W '02n '— ^^en indices and '02n+i •— 

for odd indices. By assumption, we have ||(/? — (/?o|| — d(ip,A) = ||(/? — and thus, by 
repeating the steps above, we conclude ('0n)neN is ^ Cauchy sequence that converges to 
some element. However, since the sequence is alternating, the two elements (^^ = ipQ are 
in fact identical. □ 

As we have seen, the condition that the set is convex and closed is crucial in the proof. 
Otherwise the minimizer may not be unique or even contained in the set. 

Corrolary 3.3.8 Let E be a closed subvector space of the Hilbert space J^. Then for any 
2009.11.04 (/? G Jt^, there exists ^q^E such that d(if,E) = d((/?, (/?o). 



^For all c/?,!/) e^, the identity 2 ILlP -h 2 lUlP = ||(/? + i/^ll^ + ||(/? - i/^ll^ holds. 
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3.3 Orthonormal bases and orthogonal subspaces 



This is all very abstract. For the case of a closed subvector space E c we can express 
the element of best approximation in terms of the basis: not surprisingly, it is given by the 
projection of if onto E. 

Theorem 3.3.9 Let E ^ ^ be a closed subspace of a Hilbert space that is spanned by count- 
ably many orthonormal basis vectors {'^k}kej^' Then for any if G J^, the element of best 
approximation Lp^G E is given by 



^0 = '^{^k,^)^k' 



kej^ 



Proof It is easy to show that Lp - Lp^ is orthogonal to any ifj = Xifcej? ^k^k^ focus 
on the more difficult case when E is not finite dimensional: then, we have to approximate 
(/?o and by finite linear combinations and take limits. We call ip^^-^ := ^^^^i^k^ ^) ^k 
and i/;^"'^ := YlLi ^i- ^^^^ ^^^^^ ^^^^ 



{ip - ip^^\ilj^'^^) = (ip -JTk=ii^k,^) '^k^TZi^i 



m n m 

■■^Xi {if, ipi) - XI XI ^^^^ ^^fc^ 

1=1 k=l 1=1 



m 



1=1 



By continuity of the scalar product. Corollary 3.3.5 we can take the limit n,m ^ oo. The 
term in parentheses containing the sum is exactly when Z G {1, . . . , m} and 1 otherwise. 
Specifically if n > m, the right-hand side vanishes identically. Hence, we have 

(<^-^o,V')= lim =0, 

n,m— >oo 

in other words if — ip^ G E^. This, in turn, implies by the Pythagorean theorem that 

Ik ~ '^ir " Ik ~ ^oir + iko - '^ir ^ Ik - ^oir 

and hence || — V^o|| — diLp,E). Put another way, Lp^ is an element of best approximation. 
Let us now show uniqueness. Assume, there exists another element of best approximation 
^0 ~ Sfcej^ ^k ^fc- Then we know by repeating the previous calculation backwards that 
ip — ip^Q G E^ and the scalar product with respect to any of the basis vectors ip^ which span 
E has to vanish. 



/ , - Wk^ ^/ - /_^^i^kl 



= {^k,^)-K 



k- 



31 
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This means the coefficients with respect to the basis {^k}kGj^ agree with those of lpq. 
Hence, the element of approximation is unique, ifQ = ip^, and given by the projection of 
if onto E. □ 

Theorem 3.3.10 Let E be a closed linear subspace of a Hilbert space J^. Then 

(i) = E ® E\ i. e. every vector if G ^ can be uniquely decomposed as if = xjj -{-xjj^ 
with ip gE,iI)^ G E^. 

(ii) E^^ = E. 



Proof (i) By Theorem [3.3.71 for each G J^, there exists (fo^E such that d{i^,E) = 
d(ip,ipQ). From the proof of the previous theorem, we see that ip^ := if — ifQ G E-^. 
Hence, if = ip^ -\- ip^ is a decomposition of To show that it is unique, assume 
^0 = ^ = ^0 + ^0 another decomposition. Then by subtracting, we are led 

to conclude that 

E^^o-^o = ^o-^o^ ^ E^ 

holds. On the other hand, E n E-^ = {0} and thus lpq = Lp^ and Lp^ = (p^'^, the 
decomposition is unique. 

(ii) It is easy to see that E c Let (p G E-^-^. By the same arguments as above, we 
can decompose (p G E^-^ c into 

with (po^E ^ E^^ and (p^ e E^. Hence, (p - (po e E^^ n E^ = (E^)^ nE^ = {0} 
and thus (p = (p^eE. □ 

Now we are in a position to prove the following important Proposition: 

Proposition 3.3.11 A Hilbert space ^ is separable if and only if there exists a countable 
orthonormal basis. 

Proof <=: The set generated by the orthonormal basis ^ countable, and coeffi- 

cients z = q + ip, q,p G Q, is dense in J^, 



Assume there exists a countable dense subset ^, i. e. ^ = J^. If is finite dimen- 
sional, the induction terminates after finitely many steps and the proof is simpler. Hence, 
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3.4 Subspaces (®) and product spaces ((S>) 



we will assume to be infinite dimensional. Pick a vector (pi G ^\{0} and normalize 
it. The nor malized vector is then called ipi. Note that ipi need not be in ^. By Theo- 
we can split any ip ^ ^ into ipi and ip-^ such that ip^ G span{(/?i} := E^, 
^i}'^ := and 



rem 



3.3.10 



i/^^ G span 



pick a second (^2 ^ ^ \ (which is non-empty). Now we apply Theorem 3.3.9 (which 
is in essence Gram-Schmidt orthonormalization) to (p2, i- we pick the part which is 
orthogonal to Lpi, 

and normalize to if 2, 



^2 '= 



^2 

W2\\' 



This defines E2 := span ip2} and = E2 ® ^2- 

Now we proceed by induction: assume we are given E„ = span .... Take 
^n+i ^ ^ \ ^i^d apply Gram-Schmidt once again to yield ifn+i which is the obtained 
from normalizing the vector 



k=l 

This induction yields an orthonormal sequence { V^nlneN which is by definition an orthonor- 
mal basis of E^ := span ^^^^ a closed subspace of J^. If E^ c we can split the 
Hilbert space into ^ = E^® E^. Then either ^ n (J^ \ E^) = - in which case ^ can- 
not be dense in - or ^ n (J^ \E^) ^ Q. But then we have terminated the induction 
prematurely. □ 



3.4 Subspaces (0) and product spaces (0) 

There are several ways to split Hilbert spaces: in direct sums and direct products. With 
the same techniques, we can construct new ones from existing Hilbert spaces. In The- 
orem 3.3.10 we have shown that if E is a closed subspace, then Jif decomposes into a 



direct sum 



That means any vector (/? = 1/^ + 1/;^ GJ^ can be uniquely decomposed into G E and 
1/;-^ G E^. We now define the direct sum of two Hilbert spaces: 
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Definition 3.4.1 (Direct sum ®) Let and. Hilbert spaces with scalar products 

and {',')2- Then we define ^he carteisan product x of vector spaces 

endowed with the structure of a vector space in the following way: for any ^ = (^i, if 2), = 
'02) ^ ^1 X Jif2 aeC,we define 

(i) addition component-wise, + = (^1, ^^2) + ('0i^'02) (^1 + '0i^ ^2 + '02^ 

(ii) scalar multiplication component-wise, aip = a(ipi,ip2) := iaipi,aip2\ 
(Hi) and the scalar product on ® J^2 of the two scalar products, 

(if^ip) := {if 1,^1)1 + (^2. '02)2- 
Proposition 3.4.2 The direct sum Jt^ ^2 of two Hilbert spaces is a Hilbert space. 

Proof One immediately checks that Jt^ ^^2 is a vector space. Completeness also fol- 
lows from the completeness of the components: let {(/^^"''IneN t>e a Cauchy sequence with 
respect to the norm induced by (•,•). Writing out the definition, it is clear that this also 
means each component {(/^j^^lneN? j — 1^2, is a Cauchy sequence in which converges 
to some ifj G Hence, (/?^"^ — > (ipi, if 2) as n ^ 00. □ 

2009.11.10 The other way to construct new Hibert spaces is taking tensor products: liifi G Jt^ and 

if 2 G J^2 are two vectors from two vector spaces, we can characterize if if 2, the tensor 
product of if I and if 2, by the following defining properties: for any ifiyij^i G J^^ and 

if2,1p2^^2 

^1 <^ (^2 + '02) = ^1 <S) (/?2 + ^1 <^ '02 
{if I + V^i) (/?2 = ^1 <^ ^2 + '01 <^ ^2 

holds. Scalars can be pushed back and forth between factors, 

^ ^2) = («^l) ^f2 = ^l^ («^2)- 

The formal definition is a lot more complicated: one has to construct a big space where 
vectors such as (aifi) (f2 and (fi (a(/?2) are distinct and then use equivalence rela- 
tions to implement the above characteristics. The constructed space is defined only up to 
isomorphism. 

Definition 3.4.3 (Tensor product 0) Let and ^2 ^0 Hilbert spaces with scalar 
products (•, ')i and (•, •)2. Then the tensor product Jt^ J^2 defined as the completion of 
the algebraic tensor product 

^1 ^2 '= {SLi^fc^ifc ^^2k\rLeN,if^j,e if 2k g ^2, ^fc ^ C VI < /c < n} 
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3.5 Linear functionals, dual space and weak convergence 



with respect to the norm induced by the scalar product 

If {^ifclfcej^i ^i^d {^2; }jej^2 orthonormal bases of two separable Hilbert spaces and 
J^2? respectively, then 

Wlk^^2j}kej^^jej^^ 

is an orthonormal basis in Jif^ J^2^ i- ^- every vector ^ G ^2 can be written as 

XI(^lfc^^2;.^)^lfc<^^2r 

Example (i) A non-relativistic spin-V2 particle lives in the Hilbert space L^(R^,C^). 
An easy but very helpful exercise is to show the following equivalence (which cor- 
respond to different physical points of views) : 

Depending on the physical situation, these identification may be very helpful in solv- 
ing a problem. 

(ii) Consider L^iR"^) L^(W^). This is the Hilbert space of two particles. If they are 
identical, we have to restrict ourselves to the symmetric and antisymmetric subspace, 
depending on whether the particle in question is a boson or a fermion. Keep in mind 
that in general, elements ^ G L^(R^) L^(R^) cannot be written as the product of 
two wave functions ipi, ^2 ^ ^^(R^), 

^/(/?10(/?2. 

We will show in an exercise that L\M.^) L^CR^) = L\M.^ x R^). 

3.5 Linear functionals, dual space and weak convergence 

A very important notion is that of a functional. We have already gotten to know the free 
energy functional 

■^free :^(Efree) c L\n^) [0,+oo) C C, 

1 ^ 

^ ^ Ef,^ei^) = — ^{(-ihd^^ifli-ihd^^if)). 

This functional, however, is not linear, and it is not defined for all G L'^(R^). Let us 
restrict ourselves to a smaller class of functionals: 
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Definition 3.5.1 (Bounded linear functional) Let 3C he a normed space. Then a map 

L :^ — 

15 a bounded linear functional if and only if 

(i) there exists C > such that |L(x)| < C ||x|| and 

(n) L(ix + ^y) = Ux)-\-^L(y) 
hold for all x,y e 3^ and [i e C. 

A very basic fact is that boundedness of a linear functional is equivalent to its continuity. 

Theorem 3.5.2 Let L : 3C — » Che a linear functional on the normed space 3C. Then the 
following statements are equivalent: 

(i) L is continuous at XqG5^. 

(ii) L is continuous. 
(Hi) L is hounded. 

Proof (i) ^ (ii) : This follows immediately from the linearity. 

(ii) ^ (iii): Assume L to be continuous. Then it is continuous at and for £: = 1, we can 
pick 5 > such that 

\Liix)\<8 = l 

for all X G ^ with ||x|| < 5. By linearity this implies for any j G ^ \ {0} that 
Hence, L is bounded with bound ys, 

|W|<i||y||. 

(iii) ^ (ii) : Conversely if L is bounded by C > 0, 

|l(x)-l(j)|<c||x-j||, 

holds for all x, j G This means, L is continuous: for ^ > pick 5 = so that 

\L(ix)-L(iy)\<c\\x-y\\<C^ = 8 
holds for all x,y such that llx - jll < ^/c. □ 



36 



3.5 Linear functionals, dual space and weak convergence 



Definition 3.5.3 (Dual space) Let 3C he a normed space. The dual space 3^* is the vector 
space of bounded linear functionals endowed with the norm 

ll^ll* sup — -— = sup |L(x)| . 

llxINi 

Independently of whether 3C is complete, is a Banach space. 

Proposition 3.5.4 The dual space to a normed linear space ^ is a Banach space. 

Proof Let be a Cauchy sequence in , i. e. a sequence for which 

II II fcj^CX) 

We have to show that {Ln)neN converges to some L G ,^*. For any £: > 0, there exists 
N{e) e N such that 

\\^k-Lj\l<s 

for all kj > N(s). This also implies that for any x G i^nM)nGN converges as well, 

\L^M-Ljix)\<\\L^-Lj\l ||x||<^||x||. 

The field of complex numbers is complete and (^n(^))nGN converges to some L(x) G C. 
We now define 

L(x) := lim L„(x) 

n^oo 

for any x G 3^. Clearly, L inherits the linearity of the (Ljjj^^^. The map L is also bounded: 
for any ^ > 0, there exists iV(^) G N such that - L„||^ < s for all n > iV(^). Then 

|(L-Lj(x)|= Hm|(L^.-Lj(x)|< lim - L„||J|x|| 

<s\\x\\ 

holds for all n > N(e). Since we can write L as L = L„ + (L — L„), we can estimate the 
norm of the linear map L by ||L||^ < ||L„||^ + £: < oo. This means L is a bounded linear 
functional on □ 

In case of Hilbert spaces, the dual Jif * can be canonically identified with itself: 

Theorem 3.5.5 (Riesz' Lemma) Let J>^bea Hilbert space. Then for all L G Jif* there exist 
ipi^^ such that 

In particular, we have \\L\\^ = WxjJiW. 
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= aI(yo) II,.' 112 =«^(yo)- 



Proof Let kerl :={(^e^|I((^) = 0}be the kernel of the functional I and as such is a 
closed linear subspace of If kerl = M', then e ^ is the associated vector, 

I((^) = 0=(0,(/?>. 

So assume kerL c ^ is a proper subspace. Then we can split = kerL ® (kerL)""". Pick 
(fQ e (kerL)-*-, i. e. L{^q} ^ 0. Then define 

We will show that L((/?) = (V^i,^). If ^ ^ kerL, then L((/?) = = (V^i,^). One easily 
shows that for ip = a a g C, 

= L(a(/?o) = «^(^o) 
ll^olP 

Every e ^ can be written as 

Then the first term is in the kernel of L while the second one is in the orthogonal comple- 
ment of kerL. Hence, L((/?) = (V^i, ^) for all ^ G J^. If there exists a second G Jif, 
then for any G Jif 

= Uif) - Uip) = {xIjl, if) - ^) = {^Pl - if). 

This implies = V^l and thus the element i/^^ is unique. 
To show ||L||^ = IIV^lIL assume L / 0. Then, we have 

l|I|U= sup|L(v.)|>|L(^)| 

\W\H 

On the other hand, the Cauchy-Schwarz inequality yields 

l|i||*= sup = sup kV-L,*^)! 

Ilv||=i M=l 

< sup ||^i||||V'|| = ||Vll|. 

Ikll=l 

Putting these two together, we conclude IlLll^^Hi/^iH. □ 
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3.5 Linear functionals, dual space and weak convergence 



Remark 3.5.6 The bidual of a Hilbert space J^** can be canonically identified with 
itself, i. e. Hilbert spaces are reflexive. 

Definition 3.5.7 (Weak convergence) Let 3^ be a Banach space. Then a sequence (x„)„eN 
in 3^ is said to converge weakly to x if for all L G 3^"^ 

L(x) 

holds. In this case, one also writes x„ x. 

Weak convergence, as the name suggests, is really weaker than convergence in norm. The 
reason why "more" sequences converge is that, a sense, uniformity is lost. If is a Hilbert 
space, then applying a functional is the same as computing the inner product with respect 
to some vector i/^^. If the "non-convergent part" lies in the orthogonal complement to 
{ipi}, then this particular functional does not notice that the sequence has not converged 
yet. 

Example Let Jt^ be a separable infinite-dimensional Hilbert space and {^nlne^ ortho- 
normal basis. Then the sequence {,<^n)nm ^o^s not converge in norm, for as long as n / 

||^n-^fc|| = 

but it does converge weakly to 0: for any functional L = {ip^, •), we see that (|^(v^n)|)neN 
is a sequence in R that converges to 0. Since {(/^nlneN is a basis, we can write 

00 

and for the sequence of partial sums to converge loipi, the sequence of coefficients 

must converge to 0. Since this is true for any L G Jif*, we have proven that ^ 
(i. e. weakly). 

In case of ^ = L^(R^), there are three basic mechanisms for when a sequence of 
functions {f{) does not converge in norm, but only weakly: 

(i) fy. oscillates to death: take fj^(x) = sm(kx) for < x < 1 and zero otherwise. 

(ii) goes up the spout: pick g G L^(R) and define f^M := k^^^ g(kx). This sequence 
explodes near x = for large k. 

(iii) fy. wanders off to infinity: this is the case when for some g G I^(M), we define 
/fcW:=g(x + /c). 

All of these sequences converge weakly to 0, but do not converge in norm. 
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3.6 Important facts on L^CR"^) 

For future reference, we collect a few facts on L^(R^) spaces. In particular, we will make 
use of dominated convergence frequently. We will give them without proof, they can be 
found in standard text books on analysis, see e. g. [LLQl J. 

Definition 3.6.1 (L^(R^)) Let 1 < p < oo. Then we define 

^P(Rd) |/ : ^ c I / measurable, dx < oo} 

as the vector space of functions whose pth power is integrable. Then L^(R^) is the vector 
space 

consisting of equivalence classes of functions that agree almost everywhere. With the p norm 

11/11,:= (£dx|/(x)r)'' 

it forms a normed space. 

In case p = oo, we have to modify the definition a little bit. 
Definition 3.6.2 (L^(R^)) We define 

if := |/ : — > c | / measurable, 30 < < oo : |/(x)| < K almost everywhere^ 

to be the space of functions that are bounded almost everywhere and 

\\f\\^ :=ess sup|/(x)| : = inf {i^C > | |/(x)| < K for almost all x ^R"^}. 

Then the space L°°(R^) := ^^(R^)/ ~ is defined as the vector space of equivalence classes 
where two functions are identified if they agree almost everywhere. 

Theorem 3.6.3 (Riesz-Fischer) For any 1 < p < oo, L^(R^) is complete with respect to 
the II 'lip norm and thus a Banach space. Ifp = 2, l2(R^) is also a Hilbert space with scalar 
product 

{f,g)= [ dxfixfgix). 
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3.6 Important facts on L^CR"^) 



Theorem 3.6.4 For any 1 <p <oo, the Banach space L^(R^) is separable. 

Proof We refer to IILLOli Lemma 2.17] for an explicit construction. The idea is to approx- 
imate arbitrary functions by functions which are constant on cubes and take only values 
in the rational complex numbers. □ 

Theorem 3.6.5 (Monotone Convergence) Let ifk)kGN ^ sequence of non- decreasing 
functions in L^(R^) with pointwise limit f defined almost everywhere. Define Ij^ := J^^ dx/;^(x); 

then the sequence (4) is non-decreasing as well. If I := lim^^ >ooh < ^hen I = 

j^, dxfixl i. e. 



lim dxf]^(x)= dx lim f^(x)= \ dx/(x) 



holds. 



Theorem 3.6.6 (Dominated Convergence) Let ifk)kef^ ^ sequence of functions in L^(R^) 
that converges almost everywhere pointwise to some f : — > C. If there exists a non- 
negative g G L'^(R^) such that |/fc(x)| < g(x) holds almost everywhere for all /c G N, then g 
also bounds |/|, i. e. |/(x)| < g(x) almost everywhere, and f G L'^(R^). Furthermore, the 
limit k — > 00 and integration with respect to x commute and we have 

lim dx/j^(x)= dx lim fj^(x)= dx/(x). 

k ^00 Ld Ld k ^00 Ld 
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4 Linear operators 



We will give a rough overview of operator theory. The most important section is that on 
selfadjointness which illuminates the connection between unitary evolution groups and 
self adjoint operators. 



4.1 Bounded operators 

The simplest operators are bounded operators. 

Definition 4.1.1 (Bounded operator) Let 3C and ^ he normed spaces. A linear operator 
T \5C — >^ is called hounded if there exists M > with ||rx||^ < M ||x||^. 

Just as in the case of linear functionals, we have 

Theorem 4.1.2 Let T : 5C — > ^ he a linear operator hetween two normed spaces 5C and 
^. Then the following statements are equivalent: 

(i) T is continuous at Xq g 

(ii) T is continuous. 
(Hi) T is hounded. 



Proof We leave it to the reader to modify the proof of Theorem 3.5.2 



We can introduce a norm on the operators which leads to a natural notion of convergence: 



Definition 4.1.3 (Operator norm) LetT \ 5C — > ^hea hounded linear operator hetween 
normed spaces. Then we define the operator norm of T as 

\\T\\:= sup llTxIl^. 

\\x\\=l 

The space of all hounded linear operators hetween ^ and ^ is denoted hy 
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One can show that ||r|| coincides with 

inf{M > I llTxIl^ < M llxll^ Vx G ^} = ||r|| . 

The product of two bounded operators T G mi'S^,^) and S G mi5^,'3^) is again a 
bounded operator and its norm can be estimated from above by 

l|rs||<||r||||s||. 

If^ = X = 2^, this implies that the product is jointly continuous with respect to the norm 
topology on X. For Hilbert spaces, the following useful theorem holds: 

Theorem 4.1.4 (Hellinger-Toeplitz) Let Abe a linear operator on a Hilbert space ^ with 
dense domain ^(A) such that (i/^,A(/?) = (Ai/;, holds for all Lp,il^ e ^(A). Then ^(A) = 
if and only if A is bounded. 

Proof <=: If A is bounded, then ||A(/?|| < M ||(/?|| for some M > and all if ^ ^ by 
definition of the norm. Hence, the domain of A is all of J^. 

This direction relies on a rather deep result of functional analysis, the so-called Open 
Mapping Theorem and its corollary, the Closed Graph Theorem. The interested reader 
may look it up in Chapter III. 5 of [RS81t] . □ 

Let T,S be bounded linear operators between the normed spaces 3^ and If we define 

(r + S)x := Tx + Sx 

as addition and 

(A - r)x :=XTx 

as scalar multiplication, the set of bounded linear operators forms a vector space. 

Proposition 4.1.5 The vector space , ^) of bounded linear operators between normed 
spaces ^ and ^ with operator norm forms a normed space. If ^ is complete, ^(^, ^) is a 
Banach space. 

Proof The fact ^(^,'3^) is a normed vector space follows directly from the definition. 
To show that ^) is a. Banach space whenever ^ is, one has to modify the proof of 



Theorem 3.5.4 to suit the current setting. This is left as an exercise. 



Very often, it is easy to define an operator T on a "nice" dense subset ^ c Then the 
next theorem tells us that if the operator is bounded, there is a unique bounded extension 
of the operator to the whole space 3^. For instance, this allows us to instantly extend the 



Fourier transform from Schwartz functions to L (R ) functions (see Proposition 5.1.9). 
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4.1 Bounded operators 



Theorem 4.1.6 Let ^ ^ 3C he a dense subset of a normed space and W he a Banach space. 
Furthermore, let T : Q) — > W he a hounded linear operator Then there exists a unique 
hounded linear extension f : 3^ — > W and \\f\\ = ||r||^. 

Proof We construct f explicitly: let x G be arbitrary. Since Qi is dense in 3^, there 
exists a sequence (x„)„eN ^ which converges to x. Then we set 

fx := lim rx„. 

n ^-oo 

First of all, f is linear. It is also well-defined: (Tx„)„ep^ is a Cauchy sequence in 

||rx„-rxfc||^< ||r||^||x„-Xfc||^ ^oo^ 

where the norm of T is defined as 

11^11^:= sup 

xGS!\{o} W^Ws^ 

This Cauchy sequence in ^ converges to some unique y G as the target space is com- 
plete. Let (x^)„eN t>e a second sequence in ^ that converges to x and assume the sequence 
(rx^)„eN converges to some G We define a third sequence (^„)„eN which alternates 
between elements of the first sequence (x„)„eN ^^e second sequence (<^^)nep^j i« g. 

for all n G N. Then also converges to x and (r^„) forms a Cauchy sequence that 

converges to, say, C G Subsequences of convergent sequences are also convergent and 
they must converge to the same limit point. Hence, we conclude that 

= lim = lim r%2n = 1™ '^^n — J 

n )"oo n ^'oo n )-oo 

= lim rz2„-i= lim Tx'^=y' 

n ^oo n ^oo 

holds and fx does not depend on the particular choice of sequence which approximates 
X in Qi. It remains to show that ||f || = ||r||^: we can calculate the norm of f on the dense 
subset ^ and use that f |^ = T to obtain 

||f||= sup||fx||= sup sup 



xeS^ xG^\{0} 11^ II xG^\{0} 11^ II 

llxINi 

llTxIl 

= sup . 

XG^\{0} \M\ 

Hence, the norm of the extension f is equal to the norm of the original operator T. 
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The spectrum of an operator has been related to the set of possible outcomes of measure- 
ments (if the operator is selfadjoint). 

Definition 4.1.7 (Spectrum) Let T e be a hounded linear operator on a Banach 

space ^. We define: 

(i) The resolvent of T is the set p(T) := {X G C \ T — Aid is bijective}. 

(ii) The spectrum (j(T) := C\ p[T) is the complement of p{T) in C 

One can show that for all A G p(r), the map (T — Aid)~^ is a bounded operator and the 
spectrum is a closed subset of C. One can show its <j{T) is compact and contained in 
{AgCI |A|<||r||}cC. 



4.2 Adjoint operator 

If is a normed space, then we have defined c^*, the space of bounded linear functionals 
on 3^. If T : 3^ — > is a bounded linear operator between two normed spaces, it 
naturally defines the adjoint operator T' \^'' — > ^* via 

[T'L^x^'.^LiTx) (4.1) 

for all X G ^ and L G "S^* . In case of Hilbert spaces, one can associate the Hilbert space 
adjoint. We will almost exclusively work with the latter and thus drop "Hilbert space" 
most of the time. 

Definition 4.2.1 (Hilbert space adjoint) Let ^ he a Hilbert space and A e ^(J^) be a 
bounded linear operator on J^. The antilinear isomorphism C : — > J^* taken from 



Theorem 3.5.5 maps functionals on ^ onto the corresponding vectors, i. e. Cip := •) = 
L^. Then the Hilbert space adjoint is defined as 



or put differently 
2009.11.17 for all if,xlje^. 



Proposition 4.2.2 LetA,B G ^(J^) be two bounded linear operators on a Hilbert space ^ 
and a G C. Then, we have: 

(i) (A + B)*=A*+B* 
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(ii) iaAy = a'^A'^ 

(in) (.4B)*=B*A* 

(iv) \m = \\A\\ 

(v) A** =A 

(vi) ||A*A|| = ||A4*|| = ||A||^ 

(vii) kerA = (imA*)\ kerA* = (imA)-^ 

Proof Properties (i)-(iii) follow directly from the defintion. 
To show (iv), we note that ||A|| < ||A*|| follows from 



IKII = 



/ 

\ WAifW 



,Alp 



= sup |l(A(/?)| 

llill*=l 



= sup |(A*i/;^,(/p)|<||a*|||L|| 

where in the step marked with we have used that we can calculate the norm from 
picking the functional associated to jj^: for a functional with norm 1, \\L\\^ = 1, the 
norm of L(Aip) cannot exceed that ofAip 

= < ||^J|||A<^|| = ||A</>||. 



Here, i/^^, is the vector such that L = {ipi,-) which exists by Theorem 3.5.5 This theorem 
also ensures IlLll^^lli/^^H.On the other hand, from 



sup \{A*xP^,^)\ 



k =1 



< sup ||t/'i||||A(/.|| = ||A||||L|U = ||A||||^J 

lkll=i 



we conclude ||A*|| < ||A||. Hence, ||A*|| = ||A||. 
(v) is clear. For (vi), we remark 



\\Af= sup \\A^f= sup ((^,A*A(^) 
lkll=i lkll=i 

< sup ||a*A(/j|| = \\a*a\\ . 



This means 



||A||2< A*A < ||A* II ||A|| = ||A||^ 
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which combined with (iv), 

\\Af = ||A*f < ||A4*|| < ||A|| ||a*|| = ||A||^ 
implies ||A*A|| = ||A||^ = ||AA*||. (vii) is left as an exercise. □ 
Definition 4.2.3 Let ^bea Hilbert space and A G ^(^). Then A is called 

(i) normal i/A*A = AA*. 

(ii) selfadjoint (or hermitian) if A* =A 
(Hi) unitary if A*A = id^ = AA*. 

(iv) an orthogonal projection if = A and A* = A 

(v) positive if {ip,Aip) > for all if e J^. 

4.3 Unitary operators 

Unitary operators U have the nice property that 

{Uif,Uxlj) = {if,U''UxP) = {if,xlj) 

for all (/Jji/^ G In case of quantum mechanics, we are interested in solutions to the 
Schrodinger equation 

for a hamilton operator which satisfies H* = H. Assume that H is bounded (this is really 
the case for many simple quantum systems) . Then the unitary group generated by H, 

can be written as a power series, 

n=0 

where := id by convention. The sequence of partial sums converges in the operator 



norm to e 



n=0 
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since we can make the simple estimate 

00 2 00 2 00 2 



n=0 



n=0 

M 



n=0 



< 00. 



This shows that the power series of the exponential converges in the operator norm inde- 
pendently of the choice of i/^ to a bounded operator. Given a unitary evolution group, it 
is suggestive to obtain the hamiltonian which generates it by deriving l/(t)'0 with respect 
to time. This is indeed the correct idea. The left-hand side of the Schrodinger equation 
(modulo a factor of i) can be expressed as a limit 

^Mt)= lim i(i/.(t + 5)-i/;(t)). 
at 5 — yo ° 

This limit really exists, but before we compute it, we note that since 



l)V'o> 



it suffices to consider differentiability at t = 0: taking limits in norm of J^, we get 



1 f ^ 

:i/.(0)= lim |(i/.(5)-i/.o)= ^ X] 

. 5 — yo ° 5 — >o ^ * 

= lim V^^5"-iH"a/.o = -iH^o- 



(-0" 



n=0 



^►0^ n! 

n=l 

Hence, we have established that e~^^^'0o solves the Schrodinger condition with 1/^(0) = 

i^V^(t) = HV^(t). 

However, this procedure does not work if H is unbounded (i. e. the generic case)! Before 
we proceed, we need to introduce several different notions of convergence of sequences 
of operators which are necessary to define derivatives of [/(t). 

Definition 4.3.1 (Convergence of operators) Let A„ e ^(J^) be a sequence of bounded 
operators. We say that the sequence converges to ^ ( J^) 

(i) uniformly /in norm if lim„ )-oo||^n ~^|| — 0- 

(ii) strongly if lim„ lU^i/^ -Ai/^ 11 = /or aZZ i/^ G Jif. 
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(Hi) weakly if limj^ y^{^f,Aj^ilj-Ailj) = for all ^p^ip e J^. 

Convergence of a sequence of operators in norm implies strong and weak convergence, 
but not the other way around. In the tutorials, we will also show explicitly that weak 
convergence does not necessarily imply strong convergence. 

Example With the arguments above, we have shown that if H = H* is selfadjoint and 
bounded, then t e~^^^ is uniformly continuous. 

If ||H|| = 00 on the other hand, uniform continuity is too strong a requirement. If H = 
— is the free Schrodinger operator on L^(R^), then the Fourier transform ^ links the 
position representation on L^(R^) to the momentum representation on L^(R^). In this 
representation, the free Schrodinger operator H simplifies to the multiplication operator 

acting on L^(R^). More elaborate mathematical arguments show that for any t G R, the 
norm of the difference between = e~^^2^^ and &(0) = id 

||[7(t)-id|| = sup|e~^'2^'-l| =2 

is exactly 2 and U{t) cannot be uniformly continuous in t. However, if i/i G L^(R^) is a 
wave function, the estimate 

\\u{t)i-^f= dk\e-''\^"-lf\%i^{k)f 

<7^ \ dfc|viwi'=4||viir 

shows we can invoke the Theorem of Dominated Convergence to conclude is strongly 
continuous in t G R. 

Definition 4.3.2 (Strongly continuous one-parameter unitary group) A family of uni- 
tary operators { t/(t)}teR on a Hilbert space is called a strongly continuous one-parameter 
unitary group - or unitary group for short - if 

(i) t ^ U(t) is strongly continuous and 

(ii) U(t)U(t') = U{t + tO as well as U{0) = id^ 
hold for all t, t' G R^. 
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This is again a group representation ofR^ just as in the case of the classical flow <l>. The 
form of the Schrodinger equation, 



i--Mt) = Hxljitl 
at 

also suggests that strong continuity/differentiability is the correct notion. Let us once 
more consider the free hamiltonian H = — on L^(R^). We have shown in the tutorials 
that its domain is 

In Chapter [sj we will learn that ^(H) is mapped by the Fourier transform onto 

Dominated Convergence can once more be used to make the following claims rigorous: 
for any G Qi(H), we have 



lim \r-{Uiit)-id)4-lPip\\< lim M-(&(t)-id)i/; + ^Fi/i 

V /-> lit Z 11^ Vr\ lit II I I Z 



(4.2) 



The second term is finite since ^p e and we have to focus on the first term. On the 
level of functions, 



lim i(e 



1) 



i — e 
dt 



-itifc2 



t=o 



holds pointwise. Furthermore, by the mean value theorem, for any finite t 
|t| < 1, for instance, then there exists < tg < t such that 



with 



This can be bounded uniformly in t by -P. Thus, also the first term can be bounded 



by uniformly By Dominated Convergence, we can interchange the limit t- 



>0 

and integration with respect to k on the left-hand side of equation ( |4.2D . But then the 
integrand is zero and thus the domain where the free evolution group is differentiable 
coincides with the domain of the Fourier transformed hamiltonian. 



lim 

t — ^0 



\{U[t)-id)^-lP^ 
This suggests to use the following definition: 



= 0. 



2009.11.18 
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Definition 4.3.3 (Generator of a unitary group) A densely defined linear operator on a 
Hilbert space with domain ^{H) £ is called generator of a unitary evolution group 
U(tl t G if 

(i) the domain coincides with 

¥(H) = \^ip^^\t^ U{t)^ differentiahle^ = ^(H) 

(ii) and for all ip G Qj{H\ the Schrodinger equation holds, 

i^U{t)^=HU{t)^p, 

This is only one of the two impHcations: usually we are given a hamiltonian H and we 
would like to know under which circumstances this operator generates a unitary evolution 
group. We will answer this question conclusively in the next section with Stone's Theorem. 



Theorem 4.3.4 Let H be the generator of a strongly continuous evolution group U^t), t G R. 
Then the following holds: 

(i) ^{H) is invariant under the action of U{t\ i. e. U{t)^{H) = Q)(H)for all t G R. 

(ii) H commutes with [/(t), i. e. [Lr(t),H]'0 := U{t)Hii) - H U{t)ii) = 0/or allteR and 
ip G ^(H). 

(Hi) H is symmetric, L e. {H(^,xp) = {lp,HxIj) holds for all ^yip ^ Q){H). 

(iv) U(t) is uniquely determined by H. 

(v) H is uniquely determined by t/(t). 

Proof (i) Let ip G ^(H). To show that U^t)!^ is still in the domain, we have to show 
that the norm of H[/(t)'0 is finite. Since H is the generator of U(t), it is equal to 



H^ = i — U{s)%p 
as 



s=0 ^ ^ 

Let us start with s > and omit the limit. Then 



= lim ^([/(s)-id)'0. 

Vq S 



([/(s)-id)[/(t)'0 = U{ty-{U{s)-id)^ = ^ ([/(s) - id)'0 



< 00 



holds for all s > 0. Taking the limit on left and right-hand side yields that we can 
estimate the norm of HU[t)xlj by the norm of Hxjj - which is finite since xp is in the 
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domain. This means [/(t)^(H) c ^(H). To show the converse, we repeat the proof 
for [/(-t) = [/(t)-^ = [/(t)* to obtain 

^(H) = [/(-t)[/(t)^(H) c [/(t)^(H). 

Hence, [/(t)^(H) = ^(H). 

(ii) This follows from an extension of the proof of (i): since the domain ^(H) coincides 
with the set of vectors on which [/(t) is strongly differentiable and is left invariant 
by U(t), taking limits on left- and right-hand side of 

\{u(s) - id)[/(t)v^ - uity-{uis) - id)^ = 

leads to [H,U(t)]ilj = 0. 

(iii) This follows from differentiating {U(t)ip,U(t)'ilj) for arbitrary if,xlj G ^(H) and 
using [Uit),H] = as well as the unitarity of [/(t) for all t G R. 

(iv) Assume that both unitary evolution groups, U(t) and U(t), have H as their genera- 
tor. For any G ^(H), we can calculate the time derivative of ||(t/(t) — [/(t))!/^! , 

^ll([/(t) - m)^' = 2^(||^ir - (^(ov^^ ^(f)^)) 

= -2Re {{-iHU{t)ip, lJ{.t)^) + {U{t)ip, -iHij{t)ip)) 
= 0. 

Since U{0) = id = U(0), this means [/(t) and [/(t) agree at least on ^(H). Using 
the fact that there is only bounded extension of a bounded operator to all of J^, 
Theorem 4.1.6| we conclude they must be equal on all of J^. 

(v) This follows from the definition of the generator and the density of the domain. □ 

Now that we have collected a few facts on unitary evolution groups, one could think that 
symmetric operators generate evolution groups, but this is false! The standard example 
to showcase this fact is the group of translations on L^([0, 1]). Since we would like r(t) 
to conserve "mass" - or more accurately, probability, we define for if G L^([0, 1]) and 
0< t < 1 



(r(t)(/p)(x) - 



if(x-t) X-tG[0, 1] 

(fix-t + 1) X-t + lG [0,1] 
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For all other t e we extend this operator periodically, i. e. we plug in the fractional 
part of t. Clearly, {T{t)ip,T{t)^) = holds for all ip,ilj e L^([0,1]). Locally the 

infinitesimal generator is —id^ as a simple calculation shows: 



d 

= i-7-^(x-t) 



t=o 



However, r(t) does not respect the maximal domain of — i^^, 

^max(-i^J = L\[0, 1]) I - id.if e L\[0, 1])}. 

Any element of the maximal domain has a continuous representative, but if (/?(0) ^ 
then for t > 0, T(t)ip will have a discontinuity at t. We will denote the operator —id^ 
on ^max(~^^jc) ^ith Pmax- Let US chcck whether P^^x is symmetric: for any G 
^max(-i^x). we compute 

{Lp,-id,xl^)=( dx(/p*(x)(-i^,i/;)(x)= [-i(/p*(x)V^(x)]'- f dx(-i)^.(/^*(x)V^(x) 
Jo Jo 



= i(<^*(o)i/>(o)-<^*(i)^(i))+ dxi-id^ipTixmx) 

Jo 

= i((/p*(0)i/;(0) - V^(l)) + i-id^if,^). (4.3) 

In general, the boundary terms do not disappear and the maximal domain is "too large" 
for -id^ to be symmetric. Thus it is not at all surprising, r(t) does not leave ^max(~^^x) 
invariant. Let us try another domain: one way to make the boundary terms disappear is 
to choose 



:={(/; G L\[0, 1]) \-id,ife L\[0, 1]), (/p(O) = = (/^(l)}. 



We denote —id^ on this "minimal" domain with Pmm- li^ this case, the boundary terms in 
equation ( |4.3D vanish which tells us that P^^^ is symmetric. Alas, the domain is still not 
invariant under translations r(t), even though Pj^m is symmetric. This is an example of a 
symmetric operator which does not generate a unitary group. 

There is another thing we have missed so far: the translations allow for an additional 
phase factor, i. e. for i/^ € L^([0, 1]) and i^ G [0, 2n), we define for < t < 1 

fr fr\n^f.^ .- / - ^) ^ - f ^ 1] 

W)^m '- _ f + 1) ^ _ t + 1 e [0, 1] • 

while for all other t, we plug in the fractional part of t. The additional phase factor 
cancels in the inner product, {T^{t)^ ,T^{t)'il)) = {^,"4^) still holds true for all e 
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L^il^A])' In general r^(t) 7^ T^'it) if 1^ ^ 1^^ and the unitary groups are genuinely 
different. Repeating the simple calculation from before, we see that the local generator 
still is —id^ and it would seem we can generate a family of unitary evolutions from a 
single generator. The confusion is resolved if we focus on invariant domains: choosing 
'd^ G [0,2n), we define to be the operator —id^ on the domain 

%i-id,) :={^e l2([o, 1]) I - id.if e l\[0, 1]), (^(0) = e-'*(^(l)}. 

A quick look at equation ( |4.3| ) reassures us that is symmetric and a quick calculation 
shows it is also invariant under the action of r^(t). Hence, P^ is the generator of T^, and 
the definition of an unbounded operator is incomplete vi^ithout spelling out its domain. 

Example Another example where the domain is crucial in the properties is the wave 
equation on [0,1], 

d^uix,t)-d^uix,t) = 0, UG ^2([0,L] x Rj. 

Here, u is the amplitude of the vibration, i. e. the lateral deflection. If we choose Dirichlet 
boundary conditions at both ends, i. e. u(0) = = u(L), we model a closed pipe, if we 
choose Dirichlet boundary conditions on one end, u(0) = 0, and von Neumann boundary 
conditions on the other, u^(L) = 0, we model a half-closed pipe. Choosing domains is a 
question of physics! 

4.4 Selfadjoint operators 

Although we do not have time to explore this very far, the crucial difference between Pj^m 
and P^ is that the former is only symmetric while the latter is also selfadjoint. We first 
recall the definition of the adjoint of a possibly unbounded operator: 

Definition 4.4.1 (Adjoint operator) Let Abe a densely defined linear operator on a Hilbert 
space ^ with domain ^(A). Let ^(A*) be the set of if ^ ^ for which there exists cj) e ^ 
with 

{Ailj,Lp) = {^,4)) Vi/^G^CA). 
For each if G ^(A*), we define A* if := <p and A* is called the adjoint of A 
Remark 4.4.2 By Riesz Lemma, (/? belongs to ^(A*) if and only if 

|(AV^,(/p)| <C||V^|| Vi/;G^(A). 

This is equivalent to saying if G ^(A*) if and only if i/; {Axjj, if) is continuous on ^(A). 
As a matter of fact, we could have used to latter to define the adjoint operator. 
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One word of caution: even if A is densely defined. A* need not be. 



Example Let/ 



e L°°(R), but / ^ L^(R), and pick i/^q e L^(R). Define 



^(r^):={i/.GL2(R)| dx|/(x)V^(x)|<oo} 



Then the adjoint of the operator 



has domain ^(rp = {0}. Let i/; G ^(T^). Then for any (/? e ^(rp 



Hence T^if = {xjjQ, (/?)/. However / ^ L^(R) and thus (/? = is the only possible choice 
for which T^ip is well defined. 

Symmetric operators, however, are special: since {Aip,xlj) = {ip,Ailj) holds by definition 
for all e J^, the domain of A* is contained in that of A, ^(A*) 2 ^(A). In particular, 
^(A*) c is also dense. Thus, A* is an extension of A. 

Definition 4.4.3 (Selfadjoint operator) Let H be a symmetric operator on a Hilbert space 
^ with domain ^(H). H is called selfadjoint, H* = H, if ^(H*) = ^(H). 

One word regarding notation: if we write A* =A, we do not just imply that the "operator 
prescription" of A and A* is the same, but that the domains of both coincide. 

Example In this sense, ^ P^i^- 

The central theorem of this section is Stone's Theorem: 

Theorem 4.4.4 (Stone) To every strongly continuous one-parameter unitary group U on a 
Hilbert space there exists a selfadjoint operator H = which generates U(t) = e~^^^. 
Conversely every selfadjoint operator H generates the unitary evolution group U(t) = e~^^^. 



2009.11.24 



A complete proof IIRSSli Chapter VIII. 3] is beyond our capabilities. 
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Schwartz functions are a space of test functions, i. e. a space of "very nicely behaved 
functions." The dual of this space of test functions, the tempered distributions, allow us 
to extend common operations such as Fourier transforms and derivatives to objects which 
may not even be functions. 

5.1 Schwartz functions 

The motivation to define Schwartz functions on comes from dealing with Fourier trans- 
forms: our class of test functions y(R^) has three defining properties: 

(i) y(R^) forms a vector space. 

(ii) Stability under derivation, d^yiR"^) c ^(R^): for all multiindices a G and 
/ G 5^(R^), we have d^f G y(R^). 

(iii) Stability under Fourier transform, 5^5^ (R^^) c 5^(R^): for all / G 5^(R^), the Fourier 



is also a test function. 
These relatively simple requirements have surprisingly rich implications: 

(i) 5^(R^) c lHm^), i. e. any / G 5^(R^) and all of its derivatives are integrable. 

(ii) S:y(R^) — >5^(R^) acts bijectively: if / G 5^(R^) c L^M^), then e y(R^) a 

(iii) For all a e N^, we have 5^((i3^)"/) = x";?/ G 5^(R^). This holds as all derivatives 
are integrable. 

(iv) Hence, for all a, a G Nq, we have x^'d^f G 



transform 




(5.1) 
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(v) Translations of Schwartz functions are again Schwartz functions, /(• — Xq) G 
this follows from dfi' - Xq) = e~^^-^o ^ for all Xq e R^. 

This leads to the following definition: 

Definition 5.1.1 (Schwartz functions) The space of Schwartz functions 
y(R') := {/ G ^-(R^) I Va, a G : ||/ < 00} 

15 defined in terms of the family ofseminorms^ indexed by a.aG Nq 

||/||„„:=sup|x«5;/(x)|, /€<^~(M<^). 

The family of seminorms defines a so-called Frechet topology: put in simple terms, to make 
sure that sequences in 5^(R^) converge to rapidly decreasing smooth functions, we need 
to control all derivatives as well as the decay. This is also the reason why there is no 
norm on 5^(R^) which generates the same topology as the family of seminorms. However, 
Ik La ~ ^ a, a G Nq ensures f = 0, all seminorms put together can distinguish 

points. 

Example Two simple examples of Schwartz functions are 

/(x) = e-^^', a>0, 

and 



\x\<l 
|x|>l 



The second one even has compact support. 

The first major fact we will establish is completeness. 

Theorem 5.1.2 The space of Schwartz functions endowed with 

S\\nn 



d(/,g):=Xi2"" sup - 

n=C) \a\+\a\=n 1 



15 a complete metric space. 



seminorm has all properties of a norm except that 11/ 11 = does not necessarily imply / = 0. 
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Proof d is positive and symmetric. It also satisfies the triangle inequality as x 
concave on Rq and all of the seminorms satisfy the triangle inequality. Hence, (5^(R^), d) 
is a metric space. 

To show completeness, take a Cauchy sequence (/„) with respect to d. By definition and 
positivity this means (/„) is also a Cauchy sequence with respect to all of the seminorms 
\\-\\aa' Each of the {x^S^fn) converge to some g^^t the space of bounded continuous 
functions ^"^(R^) with sup norm is complete. It remains to show that g^^^ = x^^x^oo- 
Clearly, only taking derivatives is problematic: we will prove this for |a| = 1, the general 
result follows from a simple induction. Assume we are interested in the sequence {dxjn)^ 
k G {1, . . . , d}. With a^. := (0, . . . , 0, 1, 0, . . .) as the multiindex that has a 1 in the kth entry 
and := (0, . . . , 0, 1, 0, . . .) G R*^ as the fcth canonical base vector, we know that 

fnM = fni^ - ^fc^fc) + d5 dxjn (x + (s - X^)e^) 

Jo 

as well as 

gooM = gooix - Xfcgfc) + ds dx^goo(x + (s - Xjjej,) 
Jo 

hold since — >gQQ and d^^fn — ^goa^, uniformly. Hence, goo ^ ^^(R^) and the derivative 
of goo coincides with goc^^, ^x^^oo — goa^,- then proceed by induction to show goo ^ 
^°°(R^). This means d(/„,goo) — ^0 as n — ^oo and 5^(R^) is complete. □ 

The norm of each element in 5^(R^) can be dominated by two seminorms: 

Lemma 5.1.3 Let f G 5^(R^). Then for each 1 < p < oo, the norm of f can be domi- 
nated by a finite number of seminorms, 



where Ci(d),C2id) G R+ and n(d) G No only depend on the dimension of R^. Hence, 
/GL^(R^). 

Proof We split the integral on R^ into an integral over the unit ball centered at the origin 
and its complement: let B„ := max|a|^2n ||/ ||ao' ^^^^ 

Vp 



VjRd J ^J\X\<1 y ^J\X\>1 y 

<Vol(B,(0))V^ ||/||„„+B„ 



59 



5 Schwartz functions and tempered distributions 



If we choose n large enough, |x| is integrable and can be computed expHcitly, and we 
get 



This concludes the proof. □ 
Lemma 5.1.4 The smooth functions with compact support S^J^CR^) are dense in 5^(R^). 
Proof Take any / eyiR"^) and take 

^ \ |x|>l 
Then /„ := g(/n)f converges to / in S^(R'^), i. e. 

iim l|/"-/|L = 



n TOO 



holds for all a, a G Nq . □ 

Next, we will show that ^ : 5^(R^) — > 5^(R^) is a continuous and bijective map from 
5^(R'^) onto itself. 



Theorem 5.1.5 The Fourier transform ^ as defined by equation ( |5.1D maps 5^(R^) contin- 
uously and bijectively onto itself The inverse is continuous as well Furthermore, for all 
/ G 5^(R^) and a,ae N^, we have 

S(x^(+i^JV) = r^?/. (5.2) 

Proof We need to prove S^(x^(+i^^)V) = first: since x^'d^f is integrable, 

its Fourier transform exists and is continuous by Dominated Convergence. For any a, a G 
Nq, we have 
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In the step marked with we have used Dominated Convergence to interchange integra- 
tion and differentiation. Now we integrate partially \a\ times and use that the boundary 
terms vanish, 

(i5(x''(+i3J«/))(0=-r^(+'%r f dx(+i3,re--f/(x) 

To show ^ is continuous, we need to estimate the seminorms of by those of / : for any 
a,aGNQ, it holds 



1 II II 



5.1.3 



In particular, this implies 5/ G 5^(R^). Since d^x^f G y(R^), we can apply Lemma 
and estaimte the right-hand side by a finite number of seminorms of / . Hence, ^ is 
continuous: if /„ is a Cauchy sequence in 5^(R^) that converges to /, then has to 
converge to ^/ G 5^(R^). 

To show that 3^ is a bijection with continuous inverse, we note that it suffices to prove 
d~^Sf = f for functions / in a dense subset, namely "^^(R^) (see Lemma 5.1.4). Pick / 
so that the support of is contained in a cube W„ with sides of length 2n. We can write / 
on as a uniformly convergent Fourier series, 

fix)= 2 /(?)e- ?, 

with 

If . , i27if' If 

VoKWj J^^ i2ny [2nfl^ 

Hence, / can be expressed as 
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which is a Riemann sum that converges to 



asSf This concludes the proof. □ 

Hence, we have shown that y(R^) has the defining properties that suggested its moti- 
vation in the first place. The Schwartz functions also have other nice properties whose 
2009.11.25 proofs are left as an exercise. 

Proposition 5.1.6 The Schwartz functions have the following properties: 

(i) With pointwise multiplication • : y(R^) x y(R^) — > 5^(R^), the space of y(R^) 
forms a Frechet algebra (i. e. the multiplication is continuous in both arguments). 

(ii) For all a, a, the map f ^ x^d^f is continuous on 

(Hi) For any Xq g R^, the map t^^ : f /(• — Xq) continuous on y(R^). 

(iv) For any f G 5^, ['^hej,/ ~/) converges to d^J as h — ^-O where e^. is the kth canonical 
base vector o/R^. 

The next important fact will be mentioned without proof: 

Theorem 5.1.7 ^^(R'^) is dense in L^(R^X 1 < p < oo. 

This means, we can approximate any L^(R^) function by a test function. We will use this 
and the next theorem to extend the Fourier transform to L^(R^). 

Theorem 5.1.8 For all f,ge ^^CR"^), we have 

dxm)MgM=[ dx/(x)(ygXx). 

This implies {^fyg) = {fyd~^g) and (Sfydg) = (fyg) ^here (•,•) is the usual scalar product 
onL^CR^). 

Proof Using Fubini's theorem, we conclude we can first integrate with respect to E, instead 
of X, 



62 



5.1 Schwartz functions 



To prove the second part, we remark that compared to the scalar product on L^(R^), we 
are missing a complex conjugation of the first function. Furthermore, (dfT = d~^f* 
holds. From this, it follows that {df,g) = {f,d~^g) and upon replacing g with ^g, that 

{df,dg) = {f,d-'dg) = {f,g). □ 

This has a nice consequence: since 5^(R^) c L^(R^) is dense and the Fourier transform 
acts unitarily on 5^ (R^) w ith respect to the scalar product, i. e. the Fourier transform is 
bounded. Theorem 4.1.6 allows us to extend ^ : 5^(R^) — > y(R^) to a unitary operator 
^ : L^CR"^) — > L^(r)7Hence, we have just proven 

Proposition 5.1.9 The Fourier transform ^ : 5^(R^) — > 5^(R^) extends to a unitary oper- 
ator ^ : L^iM"^) — >l2(r^). 

Now we will apply this to the free Schrodinger operator H = — ^A^. First of all, we 
conclude from Theorem |5 . 1 . 7| that the domain of H, 

5^(R^) c ^(H) = {ipe L^CR^) I - A^if G L^CR^)} c L^CR^), 

is dense. So let us pick initial conditions such that ipQ G 5^(R^) c L^(R^). Then we can 
Fourier transform the free Schrodinger equation. 



If we compute the right-hand side at t = 0, by Theorem 5.1.5 this leads to 

^(-iA,i/>o)(?)=|?'(5^oX?)- 

Denoting the Fourier transform oi xp hy %[>, this means the free Schrodinger equation in 
the momentum representation reads 

d 



We have already proven in section 4.3 that the unitary time evolution generated by H = 
is e~^^2^ . Sandwiching the unitary time evolution in the momentum representation 
in between Fourier transforms (which are themselves unitary on L^(R^)) yields the time 
evolution in the position representation. 

Proposition 5.1.10 Let i/^o ^ ^(M"^) c L^CR"^). Then for t ^ the global solution of the 
free Schrodinger equation with initial condition il^^is given by 

^(x,t)= : I dye^^i/;o(y)=: | dy p(x,y, t)i/;o(y). (5.3) 
[2nit) /2 J^, 

This expression converges in the norm to i/^q ^ — 



63 



5 Schwartz functions and tempered distributions 



• 1 f 2 

Proof We denote the Fourier transform of e~^^2^ by 

If t = 0, then the bijectivity of the Fourier transform on y(R^), Theorem 
[/(0) = id^. 



5.1.5 



yields 



So let t ^ 0. If i/^o is a Schwartz functio n, so is e~^^2^ i/^q. As the Fourier transform is 



a unitary map on L^(R^) (Theorem 5.1.9) and maps Schwartz functions onto Schwartz 



functions (Theorem 5.1.5), U(t) also maps Schwartz functions onto Schwartz functions. 
Plugging in the definition of the Fourier transform, for any i/^o ^ 5^(M^) and t 7^ we can 
write out [/(t)i/^o 



■(£-y]i 

dye' 2t 



.g-i|K-('->-VO' 



dye-'^«V.o(y) 

^o(y)- (5.4) 



We need to regularize the integral: if we write the right-hand side of the above as 



r. h. s. = lim 



1 

lim 



\ aye 2t 



dy 2t 



(271)^' 



\o (271)'^/^ Jw l(27iy/^ 



d^e 



we can use Fubini to change the order of integration. The inner integral can be computed 
by interpreting it as an integral in the complex plane. 



(271)' 



d^e 



1 



((^+00 



d/2 ' 



Plugged back into equation ( |5.4D and combined with the Theorem of Monotone Conver- 
gence, this yields equation ( |5.3| ). □ 



Later on, we will need the convolution, 

(/*gXx):= dy/(x-y)g(y), 

for the Moyal product the product which emulates the operator product on the level of 
functions. The convolution is an abelian product on 5^(R^), i. e. / * g = g */ is satisfied 
for all /,g e5^(R^). 
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Proposition 5.1.11 S^(M.'^) * 5^(K'^) c 

Proof Let /, g e S^(M.'^l As d(f*g) = {^njl'^f ^g, 

(5(/*g))(?)=7r^ f dxe--« f dyf(x-y)giy) 

(27i:) Jr.! Jr.! 

we see that / * g = (27i)'/^r ' (i?/ i?g) e ^^CK"*). □ 

5.2 Tempered distributions 

Tempered distributions are linear functionals on S^(R'^). 

Definition 5.2.1 (Tempered distributions) The tempered distributions 5^'(R'^) are the con- 
tinuous linear functions on the Schwartz functions 5^(E''). If L G S^'(M.'^) a linear func- 
tional, we will often write 

{L,f):=L(f) ^fe^iR'l 

Example (i) The 5 distribution defined via 

5(/):=/(0) 

is a linear continuous functional on 5^(R^). (See exercise sheet 6.) 
(ii) Let g G L^(R^), 1 < p < 00, then for / G 5^(R^), we define 

L,(n= I dxg(x)/(x)=:(gj). (5.5) 

As / G 5^(R^) c L^(R^), ^ + ^ = 1, by Holder's inequality, we have 

lU'/)l^lkllp \\fl- 

Since ||/ 1|^ can be bounded by a finite linear combination of Frechet seminorms of 
/, L„ is continuous. 
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Equation ( |5.5| ) is the canonical way to interpret less nice functions as distributions: we 
identify a suitable function g : — > C with the distribution Lg. For instance, poly- 
nomially bounded smooth functions (think of h(x,^) = + V(x)) define continuous 
linear functionals in this manner since for any h G "^^^(R^), there exists n e Nq such that 

I n 

V 1 + /i(x) is bounded. Hence, for any / G 5^(R^), Holder's inequality yields 



\ih,f)\ 



dxhix)fix) 



Later on, we will see that this point of view, interpreting "not so nice" functions as distribu- 
tions, helps us extend operations from test functions to much broader classes of functions. 
Similar to the case of normed spaces, we see that continuity implies "boundedness." 

Proposition 5.2.2 A linear functional L : 5^(R^) — > C is a tempered distribution if and 
only if there exist constants C > and /c, n G Nq such that 



|a|<fc 
\a\<n 



for aZZ/ G5^(R^). 



As mentioned before, we can interpret suitable functions g as tempered distributions. In 
particular, every Schwartz function g G 5^(R^) defines a tempered distribution so that 

(5,^g,/)=f dxd,^gix)fix) = -l dxg(x)d,Jix)=ig,-d,J) 



holds for any / G yiW). We can use the right-hand side to define derivatives of distribu- 
tions: 

Definition 5.2.3 (Weak derivative) For a G Nq and L G y^W^), we define the weak or 
distributional derivative of L as 

:= (L,(-l)l«l5;/), V/ e ^(m"). 

Example (i) The weak derivative of 5 is 

(a,^5,/) = (5,-a,/) = -5,/(o). 
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(ii) Let g G S^p^^(R^). Then the weak derivative coincides with the usual derivative, by 
partial integration, we get 



= + dx3,^g(x)/(x). 

Simil arly, th e Fourier transform can be extended to a bijection 5^^'^'^) — ' 5^X'^'^)' The- 2009.12.01 
orem 



5.1.8 



tells us that if g,/ e 5^(R^), then 

idg,f) = ig,df) 

holds. If we replace g with an arbitrary tempered distribution, the right-hand side again 
serves as definition of the left-hand side: 

Definition 5.2.4 (Fourier transform on c9^'(R^)) For any tempered distribution L e 5^XW^\ 
we define its Fourier transform to be 

idLJ) :=iL,df) V/G^(R^). 

Example (i) The Fourier transform of 5 is the constant function (2n)~'^^^, 

(55,/) = (5,5/) = 5/(0) = f dx/(x) 

(271) Jjj. 

(ii) The Fourier transform of makes sense as a tempered distribution on R: is a 
polynomially bounded function and thus defines a tempered distribution via equa- 
tion ( [531 ): 

i^x\f) = (x^;?/) = J^dxx^^ J^d?e-^-^/(?) 

d? f dx(+i)'^|e---^/(0 

M JR 



= (-l)^-(-l) I dd^l dxe--f)3|/(0 



= - I d?(2;ry/^5(05|/(0 

JR 

= -5|/(0) = ((27i)V^5,-5|/) = (-(2;r)'/^5",/) 
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This is consistent with what we have shown earlier in Theorem 5.1.5| namely 

dix'f) = i+id^m = -d^df. 

We have just computed Fourier transforms of functions that do not have Fourier transforms 
in the usual sense. We can apply the idea we have used to define the derivative and 
Fourier transform on 5^^(R^) to other operators initially defined on 5^(R^). Before we do 
that though, we need to introduce the appropriate notion of continuity on 5^X^^)' 

Definition 5.2.5 (Weak-^f^ convergence) Let 5^ be a metric space with dual A sequence 
(L„) in 5^' is said to converge to L ^ 5^' in the weak-^ sense if 

holds for all f e^. We will write w*-lim„ y^L^^ = L. 

This notion of convergence implies a notion of continuity and is crucial for the next theo- 
rem. 

Theorem 5.2.6 Let A : 5^(R'^) — > 5^(R'^) be a linear continuous map. Then for all L e 

5^'(R^l the map A' : 5^'(R^) — > 5^'(R^) 

(A'LJ) := {L,Af), f e ^(M^), (5.6) 

defines a weak-^ continuous linear map. 



Put in the terms of Chapter 4.2 A^ is the adjoint of A. 

Proof First of all. A' is linear and well-defined, A'L maps f ^ y(R^) onto C. To show 
continuity, let (L„) be a sequence of tempered distributions which converges in the weak-* 
sense to L G y'iR"^)- Then 

(A^L,,/) = (L„A/) (L,A/) = (A^L,/) 

holds for all / G 5^(R^) and A^ is weak-* continuous. □ 

As a last consequence, we can extend the convolution from * : 5^(R^) x 5^(R^) — > y(R^) 
to 

* : 5^'(R^) X 5^(R^) — > 5^'(R^) 

* : 5^(R^) X 5^'(R^) — > 5^'(R^). 
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For any f,g,h^ 5^(M.^), we can push the convolution from one argument of the duahty 
bracket to the other, 



if*g,h) = {g*f,h)=\ dy(,f*gXyMy)=\ dy dx/ (x)g(y - x)/i(y) 
dxfix)igi--)*hXx)={f,gi--)*h). 
Thus, we define 

Definition 5.2.7 (Convolution on 5^'(R^)) Let L g 5^'(R^) and g g 5^(R^). Then the 
convolution of L and g is defined as 

(L*g,/) :=(L,g(--)*/) V/e5-(K<'). 



By Theorem 5.2.6| this extension of the convolution is weak-* continuous. 
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After much preparation, we are finally in a position to formulate the tenets of Weyl cal- 
culus. If all we have to quantize are functions of positions or momentum alone, life is 
simple: the kinetic energy function T(^) ~ 2m^^ quantized via the Fourier trans- 



form: according to Theorem |5.1. 5 [ for any if e 5^(R^) c L^(R^), we have 



and we could define 

The reason why this works is that different components of momenta commute. However, 
not all functions are of this form, e. g. if one adds a magnetic field, we need to quantize 
h(x, ?) — ^ (? + In other words, we need a quantization procedure that 

encapsulates the commutation relations 

i[Q,,Qj=0 i[P,,p.]=o i[Pi,Qj]=s5ij 

of the building block operators position Q, 

(Q(^)(x) := X (fix), if G 5^(R^) c L^CR^), 

and momentum P 

(PifXx) := -ieV^ifix), if e 5^(R^) c L^CR^). 

We have added once more the semiclassical parameter e, although for much of what 
we do, this parameter is optional. Before we reformulate the commutation relations 
in a different way, we need some notation: in the hamiltonian framework, the motion 
of a classical particle is on phase space R^ x R^ =: S. x,y,2 G R^ are the position 
variables associated to the momenta ^,r],^ G R^. We will often use the capital letters 
X = (x, (^), Y = (y, 17), Z = (z,Q eE to denote coordinates in phase space. 
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6-1 The Weyl system 

Instead of working with Q and P, each of which defines a selfadjoint operator on its 
maximal domain 

we work with the corresponding evolution groups, namely translations in momentum 
and position 

[/(y) := e-^'-P, ([/(j)<^)(x) = v(x - ey), if e I^CK'*). 

These are not one-parameter groups, but strongly continuous d -parameter groups: one can 
check that 

UMUiy) = Uix + j), UixT = Ui-x), 

hold as all the different components of position and momentum commute. But what about 
combinations of U and V? Clearly the origin of the fact 

^ e-'^-(^(x - ey) = (y(r,my)ip)(x) 

can be traced back to the noncommutativity of the generators of U and V. Instead, we 
have just shown 

Uiy)Vir]) = e^'^y^ V{r])U{y) Vy G R^, G R^. 

It turns out that picking an operator ordering can be rephrased into picking an ordering 
of U and V . Since we would like real-valued functions to be quantized to potentially 
selfadjoint operators, we choose the "symmetrically" defined 

Definition 6.1.1 (Weyl system) For all X G we define 

W{X) := e-^(?-Q-^-P) =: g-icT((x,a(Q,P)) 

where cr(X, Y) := ^ • y — x • iq. 
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We note that the small parameter e is contained in the definition of the operators Q and 
P. The next Lemma tells us how this operator acts on wave functions: 

Lemma 6.1.2 For allY and if G L^(R^l (W(7)(/7)(x) = e"^'^'^''+23^V(^ + ^j) holds. 
The map Y ^ W(Y) is strongly continuous. 

Proof We use the Trotter product formula niRSSli Theorem VIIL31]) to write W(Y) as 
s- lim re-i^-^e+^^-^y^ 

= s- lim ('e-^''Qe+^^Pe-^''Qe+^^P---e-^''Qe+^^f'e-^''<5e+^^P) 
= s- lim ('e-^''Qe+^^Pe-^''Qe-^^Pe+'t^Pe-^''Qe-'i^f'e+'t^P--- 

n ^oo ^ 

The terms can be simplified using 

= e-^^-^^+^b) (e-'b-P^j) (x + e^y) 
where fc e {0, 1, . . . , n — 1}. This means the above expression can be rewritten as 

= s- lim e-';''-2fc=o(Q+^b)e+'^P. 



n TOO 



The sum in the exponential is a Riemann sum that converges to 

-E(^ + ^n3^)^^ ^ ds(x+sFy)=x + fy 

" fc=o Jo 

so that we get 

The strong continuity of 7 follows from the strong continuity of y L^(y) and 
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Remark 6.1.3 Another way to see this is via the Baker-Campbell-Hausdorff formula: for- 
mally, we have 

However, it is a bit tricky to rigorously control commutators of unbounded operators 
2009.12.08 IIRSSli Chapter VIII.5]. 

The Weyl system combines translations in real and reciprocal space. If one of the two 
components is 0, then W reduces to U or V, 

Wix,0) = Ui-x) = Uixf , 
The Weyl system completely encodes the commutation relations: 

Proposition 6.1.4 Let Y,Z gE. Then, the Weyl system obeys the following composition law: 

This contains the commutation relations: if we combine a translation along e^^ in real 
space and ej in reciprocal space, 

W(efc,0)W(0,ep = e'^f^^^ Wie^,ej), 
then the extra factor reduces to the exponential of half of [Qj^, Pj] = —ie5j^j. 
Proof This follows from direct calculation: for all if e L^(R^), we have 
(W(7)W(Z)(/;)(x) = e-^^^+i^>^ (W(Z)(/?)(x + sy) 

= gif (-y<+^-r))g-i(x+f (y+.)Kr,+0^ + ^(^y + 

= e^l^^^'^^ (W(7 + Z)(/p)(x). 
Hence, the claim follows. □ 

6.2 Weyl quantization 

To define the Weyl quantization, we first introduce a convenient variant of the Fourier 
transform, the symplectic Fourier transform on S 
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has the nice property of being an involution, i. e. = id^. If we pretend for a moment 
that Q and P are variables and not operators, we have 

= ^^| dxj d7e-(WQ,P))/(y) = J dy5(y-(Q,P))/(n 

Even though the above does not make sense if Q and P are operators, it gives an intuition 
why we define Weyl quantization the way we do: 

Definition 6.2.1 (Weyl quantization) Let f e Then the Weyl quantization of f is 

defined as 



\^Zn ) 



Lemma 6.1.2 essentially already tells us how Op(f) acts on wave functions: 

Proposition 6.2.2 Let f e 5^(E) and if e L^(W^). Then Dp(/) defines a bounded operator 
on L^(R^) and its action on Lp is given by 



(Dp(/)<^)W=— — dy dr,e-'(^-->V(K^ + y),f^)</'(y) (6.1) 



X 

Proof Let if e 5^(R^) c L^(R^). Then we can bound Dp(/)(/? by 

= (271)-' ijscx/ILi^s) Ikll- 



The L^-norm of ^^rf is certainly bounded as ^^f is a Schwartz function and thus in- 
tegrable. Hence, Dp(/) defines a bounded operator on 5^(R^) c L^(R^). Since the 



75 



6 Weyl calculus 



Schwartz functions are dense in L^(R^) by Lemma 5.1.7 we invoke Theorem 4.1.6 which 
ensures the existence of a unique extension of Dp(/) to all of L^(R^) . 

Equation ( |6.lD follows from direct computation and Lemma 6.1.2 
we have 



for any if G L^(r ), 



[Opif)if)ix) = 



1 



1 



(271) 



2d 



(271) 



2d 



dY \ dZe^^^^'^V(^)(^(^)^)U) 



dy dr] \ dz \ d^e 



^ii7]-z-yO 



-i7]ix+'-y) 



dy dr] \ dz \ d^e 



Jy^iz-lix+y))^-iiy-xH 



'/(^.a^(y) 



If we integrate over 17, we get a 5 that can be used to kill the integral with respect to z. 

dy I dz I da2ny5{z-Ux + y})e-'^y-''^<- 



(Dp(/)(/>)W = 



(.271) 



,2d 



= 7^1 d?e-H3^->^/(i(x + y),0v^(y) 

(271^)^ Jm^ J^d 



Identifying the integral with respect to C as partial Fourier transform, we get the second 
line of equation ( |6.1D . A variable substitution y^ := ^y yields the first line. □ 



Example Although we cannot justify this example rigorously yet, we will only do a quick 
sanity check whether the quantization of h(x, ^) = + gives the expected result. 
By linearity, we can consider each of the terms in turn: we ha ve a lready computed the 
Fourier transform of in the distributional sense in Chapter 5.2 for all ip G 5^(R^) 



(271)"/^ 

1 1 



dy — (;?(eV))(y-^)^(y) 



2m (271)"/^ 

holds. We see that this calculation hinges on ip being a test function. The other term 
can also be calculated using 

^g+ix t, ^ (271)'' 5^ where 5^(/) := f{x) is the shifted Dirac 
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distribution, 



(Dp(y)(/p)(x) = --^ dy{:Sl){y-x)V{\{x + y))ip{y) 

= 7:7^ \ dy(2;ry^^5(y-x)y(|(x + y))^(y) = V(x)^(x). 

Hence, the quantization of h yields 

Opih) = ^i-ieV,? + Vix) = ^P' + V(Q), 

exactly what we have expected. We see that it was crucial for this argument to work that 
G and we have to work to extend the integral formula to other if G L^(R^) \ 

y(M.^). If V is bounded , for i nstance, then Op(y) = V(Q) defines a bounded operator 
and we can use Theorem 4.1.6 to extend it to all of L^(R^). P^, however, is an unbounded 
operator and thus does not extend to a bounded operator on all of L^(R^) 



In Proposition 6.2.2 we have introduced the notion of operator kernel: the operator 
kernel is a function or distribution that tells us how the operator acts on wave functions. 
It can be useful in calculating things, e. g. if one wants to compute a trace. 

Every bounded operator and many unbounded ones have a distributional operator ker- 
nel. In general, it is not a function, but only distribution on x . The operator kernel 
of -id^^ is +i(27i)'/'^^^5(x - y) since for all if g 5^(R^) c L^CR^) 

i-id,^^Xx)={(+i)d,5,,^) ^ __L_ J^dyi^^^^^ 

= [3nii+ii2nf'd,5ix-y))ipyx) 

holds. Similarly, the operator kernel associated to T = \'^){^ \ = {^,' ) ij), ^ 5^(R^) 
is 



1 

(271)"/^ 

1 

■ (271)"/^ 



dy I Az{2nf'^\z)4>{z)5{x-y)^l,{y) 
dz{{2nri' \ Ay^*{z)5{x-ymy)\4>{z) 



dLzKj{x,z) 4>{,z). 
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Hence, even this well-behaved operator has a distributional operator kernel! 2009.12.09 

To get back to the properties of quantization procedures: Weyl quantization is linear, 
i. e. 

Dp(/ + ag) = Dp(/) + aDp(g) 

holds for all f,g^ ^(S) and a G C. Furthermore, we can compute the quantization of 
the constant function 1 to be 



= ^ix)= (idi2(/?)(x). 

In the tutorials, we will show that Weyl quantization intertwines complex conjugation and 
taking adjoints. 

This fact has the important consequence that real-valued functions are potentially mapped 
onto selfadjoint operators. 

Theorem 6.2.3 Weyl quantization is linear, maps the constant function 1 to id^2 and in- 
tertwines complex conjugation with taking adjoints in the sense of operators, L e. £)p(/*) = 
Dp(/)* holds for all f g 5^(S). 



6.3 The Wigner transform 

One natural question is whether we can rewrite the quantum expectation value (if, Op(f )'0 ) 
as a phase space average of / with respect to some measure lJi^p^^, 

{ip,Op(f)iP) = -^ I dXlidJXX){^,W(X)^) (6.2) 



1 



(2ny 



<ixfix)d,iW,wi-)xp))i-x), 



suggests to look at the symplectic Fourier transform of the expectation value of the Weyl 
system. Let us start with the first building block: 
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Definition 6.3.1 (Fourier-Wigner transform) Let e 5^(R^). Then we define the 
Fourier-Wigner transform p(il^,Lp) to be 

pixlj,ifXX):=i2Kr'^^if,WiX)xlj) (6.3) 

Lemma 6.3.2 Let ilj,ip e 5^(R^). Then it holds 

p(V^,(/p)(X) = (27r)-^/^((^,W(X)i/;) = --^ f dye-^-^(/p*(y-|x)V^(y + |x). 

Proof By formal manipulation, we get 

p(^, ifXX) = {2nr'l^ {if, = J ^ dy ^*(y) (W(X)^)(y) 



1 



^ f dyv'*(y)e-'«^+§-)^(y + ex) 

dye-'^-?<^*(y-fx)V'(y + fx). 



(271)^ 

Since xjj and are Schwartz functions, they are also square integrable and the right-hand 
side exists for all X G S. □ 

To write the quantum expectation value as a phase space average, we still have to push 
over the Fourier transform. 

Definition 6.3.3 (Wigner transform) Let ^ 5^(R'^). The Wigner transform Wiip, if) 
is defined as the symplectic Fourier transform of p{il),ip). 

Remark 6.3.4 There is a reason why we need to use W(—X) and not W(+X): the sym- 
plectic Fourier transform is unitary on L^(S) and 

holds. The extra sign stems from the fact that we are missing complex conjugation in 
integral ( |6.2p . 



Wixlj, ifXX) = I dy e-'y^ (/p* (x - |y) i/. (x + f y) . 



Lemma 6.3.5 The Wigner transform W(xp, with respect to xp,if G S^(R'^) is given by 

1 

(271)"/^ 
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Proof We plug in the definition of the symplectic Fourier transform and obtain 

= 7^J I dze-'(f-™)e--''<^*(z-|y)./,(z + fy) 

= 77^ C dy [ dz [ dr,e«-^>''e-'^-5<^*(z-|y)V'(z+|y) 

(27T) Jr^ Jgd Jr. 

= 7^f dye-^^«v*(x-|y)V(x + |y). 

The right-hand side is also well-defined as (x - |-) i/^ (x + |-) G L^M"^) for any x G R"^ 
and hence its Fourier transform exists. □ 

Example Take d = 1, e = 1 and consider (/?(x) = xe~^, for instance, the first excited 
state of the harmonic oscillator. Using 



2009.12.15 we calculate the Wigner transform to be 



^(</',</')(x,?) = -^ I dye-'3"f</,*(x-f)^(x + f) 

= 2e-T-L f dye-'^-2«(x2-y2)e-T 

= 2(x2 + (20'-l) e-2^'e-2«' ^ 0. 
Corrolary 6.3.6 For all ip,(fi e 5^(R'^) and f e 5^(E), we have 

(<^,Dp(/)V') = J <iXf{X)W{^P,ifXX). 

The Wigner transform has some very interesting properties: 
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Theorem 6.3.7 (Properties of the Wigner transform) Let ^ 5^{M.^\ x g and 



(i) is a real-valued function, but not necessarily positive. 

(ii) The marginals of the Wigner transform ofif,!^^ 5^(M.^) with respect to x and E, are 



1 



(iv) \\n^,i')\l2^s^ = s-'''y\l 



(v) Let R be the reflection operator defined by [R(fXx) := (f(-x). Then W(Rip,Rip)(X) = 
Wixp, If X-X) holds. 

(vi) W[^*,^*)ix,0 = '^(f,^Xx,-0 

(vii) WiU(y)xP, U(y)if)(,x, = -^(xp, - y, ^)for all y e 

Proof (i) We have to show "^(^^p, = "^(f, f): the complex conjugate of W^f, is 



Previously on page 80 we have already given an example where is not 

positive. 

(ii) If we take the marginals with respect to x, then up to a factor of {27if^^ that is due 
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to the choice of convention in Definition 1 6. 3. H we get 



dxWi^P,ipXx,^)=-^\ dx\ dye-'y^^ix-lyy^ix + ly) 



2- 



dx' \ dye-'y^ipix')*iPix' + sy) 



X 

1 

X X 

The other marginal can be obtained analogously. 

(iii) This follows immediately from (ii) since ^ and ip are square integrable. 

(iv) We plug in the definition of the Wigner transform and compute 

lk(^'<^)|&(s)= f f A^\w{^p,^){x,Qf 
1 



(271 f 



dx dy dy'e+'>-fe-'>'-«- 

; Jm^ Jm^ Jm^ 

• ^*(x - |y) ^(x + |y) ^(x - §/) ^*(x + §/) 
dx dy dy' { d?e+'(^-^'>« 



dx I dy(^*(x-fy)i/'(x+|y)(^(x-fy)ii;*(x + fy). 



After two changes of variables, this simphfles to 



L2(Md) II ^ ||L2(^d) • 
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(v) This is a direct consequence of the definition, 



WiRi,,Rifi)ix,0 = yz^, f dye->f(R<^r(x-|y)(R^)(x+|y) 

dye-^«V*(-x + |y)V'(-x-fy) 



1 



d/ e-'3'' C-« ((-X) - f y') V ((-X) + f /) 



(vi) Follows directly from the definition of the Wigner transform. 

(vii) Follows directly from the definition of the Wigner transform. □ 

The Wigner transform is also the inverse of the Weyl quantization: first, we extend the 
definition to ^^CR"^ x R^). 

Definition 6.3.8 (Wigner transform on 5^(R^ x R^)) We can extend the Wigner trans- 
form to functions K e 5^(W^ xW^) by setting 

WKix,^):=--^( dje-^-^lC(x+|j,x-|j). (6.4) 

Lemma 6.3.9 W is an isomorphism between 5^(R^ x R^) and ^5^(5). 

Proof W consists of a linear, bijective change of variables and a partial Fourier transform. 
Both leave 5^(R'^ x M^) invariant (mathematically, R"^ x R^ and S = R^ x R^ are equiv- 
alent). The bijectivity follows from the bijectivity of the Fourier transform in the second 
argument. □ 

The Wigner transform is essentially the inverse of Weyl quantization: 

Proposition 6.3.10 Let T G Dp (5^(S)) c ^ (L^(R'^)) be an operator with operator kernel 
Kj. The map Dp~^ : Dp(5^(S)) — > defined by 

is the inverse to Weyl quantization, L e. we have T = Op{e^ WKj) for all T G Dp(5^(S)). 
Conversely if we take any f G 5^(S) with Weyl kernel 

^/ = 7^ f dr7e-'(^-->V(|(x + y),er,)=£-''(y2/)(i(x + y),^), 
then Dp-^(Dp(/)) = s'^WKf =f holds. 
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2009.12.16 

Remark 6.3.11 The Weyl kernel map K : 5^(E) — > S^iR"^ x R^) which maps / G 5^(E) 
onto 

Kf[x,y) := e-^ {^2f){\{x + y), ^) 

is bijective as it consists of a bijective linear change of variables and a partial Fourier 
transform. 

Proof Let Kj- be the operator kernel associated to the operator T g Dp(5^(S)). Then Kj 
has to be in 5^(R^ x R^): as T is an operator that has been obtained by Weyl quantization, 
there is a preimage fj G and its Weyl kernel Kf^ has to be in 5^(R^ x R^). Writing 

r(/?, (/? G 5^(R^), in two different ways, 

(271) 1^ J^d 



(271)^/^^, (271)' 



= 7^J dy(K,(x,y)-K^^(x,y))(/p(y), 

^^^^^ JR^ 



we conclude Kj and i^j^ have to agree almost everywhere. Without loss of generality, we 



take the continuous (and thus smooth) representative Kf e 5^{R'^ x M'*). 



We have to confirm that 



holds. Plugging in the definition and making a change of variables, we get 



dy dT] dze-'(>'-'>''e-'^-"''- 

• J^T (|U + J) + |Z, |(X + J) - f Z) </P(j) 

dy dr) dz'e-''e+'(^-^-"'>''- 

■KTiKx + y + z'), Kx + y- z')) i^(y). 
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Now we integrate over r] which yields a 5 distribution that can be used to kill one of the 
remaining integrals. 



(271)' 



d/2 



(271)' 

1 



(271)' 



dy d/5(x-y-/). 

'KT{lix + y + z'l lix + y- /)) (/p(y) 
I dyi^^(|(x + y + (x-y)),|(x + y-(x-y)))(/p(y) 

dyi^7^(x,y)(/?(y)= (r(/?)(x). 



On the other hand, let T = Dp(/) be the Weyl quantization of / G 5^(S). Then Dp~^ T =f 
follows from direct calculation: using 

Kf (x + |y, X - |y) = e"'' (^5,/) (1 (x + f y) + 1 - f , " 1^) " + 1^) ) 



we get 



= d^^f dr]e-^-?e+^^ V(x,ry) = /(x,a 

(27r)^ J^d J^d 



To show that Dp ^ maps Dp(5^(S)) onto 5^(S), we invoke Remark 6.3.11 and Lemma 6.3.9 



which state that the kernel map K: ^{B) — > 5^(R'^ xM!^), f ^Kf, and the Wigner trans- 
form W : 5^(R^ X R^) — > 5^(S) are bijective. Hence the composition of the kernel map 
K and the Wigner transform ^ is a bijection as well. In fact, 

WoK\5^{S) — >5^(S) 

is the identity map by the above calculation. □ 

Example Let us see if we can dequantize = Dp((^^) to (^^; for simplicity set d = 1, 
although the arguments carry over to arbitrary dimension. The operator kernel associated 
to is 



]C(x,y) = V27r(-iE)^5^^(x-y) 
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since for any G we have in the sense of distributions 



dyKiix,y)ipiy)=-= dy (-isY - y)ipiy) 



'2k Jre v2k , 



Formally, we can dequantize P^, 

Dp-\P') = siWKXx,0 = ^ f dye->fK(x + |y,x-|y) 

This is exactly what we have expected to get. 

Remark 6.3.12 In the physics Hterature, one often uses bra-ket notation to write the 
Wigner transform. Adopting our conventions regarding factors of 271 as well as signs and 
setting s = 1, the Wigner transform of the operator T can be written as 

wnx,0 = 7T^A dye-'yi{x + \\T\x-\). 



{In-) 



The term (x + ||r|x-|) is nothing but the operator kernel i<^7^(x + |,x - 1). Sometimes 
(x + ^|r|x — |)is expanded in terms of eigenfunctions (which usually cannot be the 
eigenfunctions of T as the spectrum need not be purely discrete!). 

So far we have made no attempts to generalize Weyl quantization and Wigner transform 
beyond Schwartz functions. We shall resist the temptation for now. Weyl quantization, 
for instance, could immediately defined for functions / : S — > C such that ^^f e L^(S). 
Similarly, as the Fourier transform extends to a unitary map on L^(R^), we could have 
extended the Wigner transform to a unitary map between L^(R^ x R^) and 



6.4 Weyl product 

The Weyl product [t emulates the operator product on the level of functions on phase 
space, i. e. it satisfies 
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for suitable f,g:E — > C. It can be derived from the composition law of the Weyl system 
(Proposition 6.1.4| ) . 

Theorem 6.4.1 For f,g g 5^(S), the distribution f^g which satisfies Dp (^f^g) = Opif)Dpig) 
is a Schwartz function given by 

(/ttgXX)= J drj dZe-(™)e'1<^('''^H5,/X7)(5,gXZ) (6.5) 

Before we can prove this statement, we need an auxiliary result: take two operators T 
and S whose operator kernels Kj and are in 5^(R^ x R^). Then the operator kernel of 
TS is given by 

iKrOKsXx,y):=--^ dzKrix,z)Ksiz,y). 

(271) J^d 

Lemma 6.4.2 For any Kj,Ks e 5^(R^ x R^), the product Kj oKs is also in y(R^ x R^l 
ie.o: y(R^ x R^) x 5^(R^ x R^) — > y(R^ x R^). 

Proof We need to estimate the seminorms of Kj-oK^: let a, a,b,l5 G Nq be multiindices 
and for simplicity define ^ : (x,y,z) ^ (271^^^ KT(x,z)Ksiz,y). Then ^ e 5^(R'^ x R^ x 
R^) is a Schwartz function in all three variables. First, we need to show we can exchange 
differentiation with respect to x and y and integration with respect to z,Le. that for fixed 
X and y 

= 7r^f dzx-y''dp^P{Kr(x,z)Ksiz,y))=( dzx''y^5;3/$(x,y,z) (6.6) 

holds. We will do this by estimating |x^y^^^"^^$(x,y,z)| uniformly in x and y by an 
integrable function G(z) and then invoking Dominated Convergence. For fixed x and 
y, we estimate the norm of x'^y^ d^d^(^(x,y,') from above by a finite number of 



seminorms of (^(x,y, •) with the help of Lemma 5.1.3 



dz |xV'3;5/*(x,y,z)| = ||x''y''5;5/*(x,y,-)||,,( 



< 



Q sup|x^y 



= Q \\x''y'd^^dPHx,y,-)\lo + C2 max||xV'5;5/<f(x,y,-)||,o- 

|c| — 2n 
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Now we interchange sup and integration with respect to z, 



sup dz\xy^d^d^^(ix,y,z)\< I dz sup \xy^d^d^^(ix,y,z)\ 



sup |xV'5;5/<f(x,y,0| 



which can be estimated from above by 

sup \xy''d^d^Hx,y,')\ , , <Ci sup sup|x^y^a;^/<^(x,y,^)| + 



+ C2 max sup sup|x^y^^^"^^<l>(x,y,2)| 
Ci ll^ll /,«nn + C'o max II <l> II , « „ < 00. 

^ II llactbpOO ^ ui_o„ll WaabBcO 



|cN2n" 

Here {IMLab^crla « b ^ c ^gn^ family of seminorms associated to 5^(R^ x x R^) 

which are defined by 

maabpcr ■■= sup |xVV5;3/3J*(x,y,z)|. 

jc,y,2;eM^ 

This means we have found an integrable function 

G(z):= sup |xV'3;5/*(x,y,z)| 

jc,yeM^ 

which dominates x^y^^^"^^^>(x,y,z) for all x,y e R^. Hence, exchanging differentiation 
and integration in equation ( |6.6D is possible and we can bound the 

5^(R^ X R^) by 

\\^tOKs\1,,p = sup \xy'd^^d^^iKjOKs)ix,y)\ 

jc,yeM^ 

< Q 

This means oKs ^ yiR"^ xR"^). 

Proof (Theorem 16.4.1 1 ) Using the definition of Dp, we get 



semmorm on 



uRnn + max <l> , « „ 
aabpOO ^ |c|=2n" llaaopcO 



< 00. 



(27:} 



.2d 



dY dZ(,^J){Y)(,dagXZ)W(Y)W(,Z) 
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We recognize the inner integral as {dcr(fis))i^) thus we add a Fourier transform to 
obtain the first of the two equivalent forms of the product formula: 

= \ '^^ '"^'''^^ e'l-(^'^-^) (dafXY) id^gXZ - 7) 

It remains to show that /jjg is a Schwartz function: from Remark 6.3.11[ we know that 
the Weyl kernels Kf and Kg of / and g are in 5^(R^ x R^). Hence, we can write the 
operator product of Dp(/) and £)p(g) in terms of the associated integral kernels, 

(Dp(/)Dp(g)(/p)(x)=-^ I dy I dz-^Kfix,z)K^iz,y)^iy) 
(271) Jr. Jg, (271) 

= 7^1 <iy(KfOK^)ix,y)^iy) = DpifU)- 



By Lemma 6.4.2 Kf oK^ e y(W x W). We use the inverse of Weyl quantization. Propo- 
sition |6.3.1Q| to conclude 

/flg = Op-i (Dp(/)Dp(g)) = oK^) e 5-(H) 

as W maps S^QK'^ x R'^) bijectively onto 5^(S). 

To show that the first form of the Weyl product, equation ( |6.5D , is equivalent to the 
second, we have to write out all Fourier tranforms and collect the exponentials properly 
to obtain 

(/flg)(X)= J d7 J dY'l dz| dZ'e'''(^'^+^)e'i<^('''^)e''^(^'^'^e'''(^'2')- 

■fiY')giZ') 

" (2^/ '^^l ^^'l '^^l dZ'e''^(^''"-^)e'''(^'^'-^-§^V(ngUO 

= J dyj dy'l dz'e'<^(^'^'-^)5(z'-x-|7)/(ng(20 
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Making one last change of variables, Z := X + ^Y, we get the second form of the Weyl 
product, 

This concludes the proof. □ 

Example Let us compute x^E, which we then Weyl quantize using problem 23 (i). Since 
neither x nor E, are Schwartz functions, we will dispense with mathematical rigor for now: 
in the distributional sense, we see from 

that we need to compute the symplectic Fourier transforms of the factors. As distributions, 
their Fourier transforms exist and for the first, we get 

Similarly we calculate the second Fourier transform to be (5^^%p(Z) = —i{2nY d^^5(Z). 
Plugged into the product formula, we obtain 

= -J dyj dZ5^,(e'''(^'^+^)e'i'^(^'^') 5(7)5f,5(Z) 

= +J dyj dZ5f,((i?,-i|C0e'"^'''''^^Vi-(^'^)) 5(7) 5(Z) 

= J dF J dZ - |C0 (x, + |yO - i|) e^-CMe'l'^t^'^)- 

•5(F)5(Z) 
= (?,-|0)(x, + f0)-4=x,?,-ei. 
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By problem 23 (i) and Dp(l) = 1(1^2, we can compute the Weyl quantization of this func- 
tion expHcitly, 

= Q-P-^diidi2 = P-Q. 
This is exactly what we have expected. 2009.12.22 
The Weyl product has the following useful properties which can be easily proven: 

Proposition 6.4.3 (Properties of Weyl product) Let G and a G C. Then the 
Weyl product has the following properties: 

(i) The Weyl product is bilinear, L e. (/ + ag)^h = /jj/i + a(gH/i) and /tl(g + cch) = 
fk + <fih) hold. 

(ii) The Weyl product is associative, i. e. (/ttg)tt/i = /tt(gtt/i). 

(Hi) If f = fix) and g = g{x) are functions of position, then the Weyl product reduces to 
the pointwise product, fig=fg. 

(iv) If f = /(<?) and g = g{^) are functions of momentum, then the Weyl product reduces 
to the pointwise product, fk=fg' 

The proofs are left as an exercise. 

6.5 Asymptotics 

The most important reason why physicists should care about Weyl calculus is that there is 
an asymptotic expansion of the product, i. e. we can write 

00 

n=0 

where all the terms (/tJg)(n) ^re known explicitly. In general, this expansion does not 
converge to any function, but instead for any iV G N, we can write 

fk = f] f "(/Bg)w + e^+'R^ = f] e"(/flg)w + ^ie^'+'l 

n=0 n=0 
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As e — >0, the remainder converges to at least as fast as e^'^^. Expansions that do 
not converge are the rule rather than the exception, e. g. when one sums up Feynman 
diagrams of different processes, they are often sorted by powers of some small coupling 
constant, a ~ ^/isv, for instance. One has no reason to believe that summing over all terms, 
one gets something finite! This is where optimal truncation comes into play: instead of 
summing all terms, one stops at some N(e) where the error is minimal. Before we state 
the main theorem of this section, we will introduce the Landau symbols ("little and big O 
notation") . 

Definition 6.5.1 (Landau symbols) Let f,g:R^ — ^ C. We say /(x) = ^(g(x)) as 
X — > Xq if and only if 

lim sup \ r < 00 

X >x, \g{x)\ 

or quivalently if there exist M > and 5 >0 such that |/(x)| < M |g(x)| /or all |x - Xo| < 
5. 

We say f(x) = o(g(x)) as x — >Xq if and only if 

fM 
lim ^=0. 

X g(x) 

Example We have x^ = ^(x) and x = ^(x) as x — ^0. However x / o(x), but x^ = o(x) 
as X — > since 

x^ 

lim — = lim x = 0. 

X >0 X X ^-O 

Let us now formulate the main theorem of this section: 

Theorem 6.5.2 Letf, g G and e <1. Then the Weyl product can be expanded asymp- 
totically in e, L e. for any iV G Nq we have 

N N 

fk = Xi^"(/fl^)(„) + e^'^'RN = Xi^"(/fl^)w + ^ie^'+'l (6.7) 

n=0 n=0 

All the terms of the expansion are Schwartz functions, (/ttg)(n) ^ S^i^for all n G Nq^, and 
are known explicitly 



<ifk\nm = (V, • - V, • v,)7(F)^(z) 



(6.8) 

Y=X=Z 
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The remainder G as given by equation ( |6.10| ) is also a rapidly decreasing function. 

The first two terms are given by 

+ (6.9) 

where {f,g}= Y!j=i ^xjS - ^x/ ^^jS) ^he Poisson bracket. 

Proof Step 1: Expanding the twister. Let N g Nq be arbitrary, but fixed. Then we can 
expand the twisting factor in they Weyl product, 

e'^c-.^) = j]s^^Y, (a(y, z))" + Uy, zi 



where 



n=0 



I ... N+1 _c^N /5fc7(7,Z) 



R^iY,Z) = —{i^^aiY,Z)y^' I dsH-sfe 



1 7"^+l 



Jo 



is the remainder of the Taylor expansion. If we plug this into the product formula, 
(/flgXX)= ^l^dy^dZe^'^™ {Zlo^"^l,ia[Y,Z)y+R,[Y,Z)} 

N 

=■■ Y,s"mg\n)iX) + s''+'R^iX), (6.10) 

n=0 

we can define the nth order terms and the remainder 
Step 2: Treating (/ttg)(n). First of all, we note that 

\a\+\b\=n 



is just a polynomial in y, r], z and ^. Since / G 5^(2), by Theorem 5.1.5 
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holds for all multiindices a, b e Nq and similarly for g. This means, the integral expression 
for (/tt^)(n) exists for all X G S and reduces to 



1 ^ (-1)1^1 1 r f 



= -- E ^^fd4dZe-C™. 

•(^a((-^5,0'(+i5y)V))Ci^)(^a((-^5c')''(+^5,0'g))U). 



The remaining integral is nothing but the symplectic Fourier transform in Y and Z. The 
symplectic Fourier transform is its own inverse, hence we get expression ( |6.8D , 



"•^ \a\+\b\=n 

= ;^^ Z ^((-'%)''(+^5.)7)m((-^%r(+^3j^g)(x) 

= ^^(Vy-v^-v,.v,)7(ng(2) 

Since each of the factors consists of derivatives of Schwartz functions, (/jjg)(n) ^ ^(S) is 
also a Schwartz function. 



Step 3: Treating the remainder Rpj, If we combine Theorem [6.4.1[ /ttg ^ ^(S) with 



Step 2, (/tJg)(„) € ^(S), we conclude that the remainder is also a Schwartz function, 

N 

RN=fk-J]'''(fk\n)^yim. 



n=0 



From the explicit expression for the remainder, equation ( |6.1Q| ), we also see that it is 
indeed of order ^(e^^^). This concludes the proof. □ 



Example In the previous section, we have calculated (^jjx. If we use the asymptotic ex- 
pansion - which in this case is exact, we can obtain this result with much less work. 
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Pluggin in equation ( |6.9| ), we get 

i ^ 
i ^ 
i ^ 

The remainder, however, is exactly 0: the factor (cr(7,Z))^ is a polynomial of second 
order in y • and r] -z and leads to two derivatives with respect to position and momentum. 
But X and ^ are linear so their second and higher-order derivatives vanish identically. 
Hence, we have shown 

E,^x = E,'X- ed'-. 

Generally if one of the factors is a polynomial of k\h degree, the asymptotic expansion 
terminates after finitely many terms and is exact. 

In Chapter [Z4l we have emphasized the importance that the the Moyal commutator 

[/.g]ti '=fh-gU 

vanishes as e — ^0. Using equation ( |6.9D , we immediately get 

The error is of third order as all contributions from even powers vanish. This can be traced 
back to 

(a(7,Z))''= = {-a{Z,Y)f'' = iaiZ,Y)Y' VfceNo 

and the commutativity of multiplication in C. We mention the latter explicitly as we will 
quantize matrix-valued functions in the next section. There, it is not at all clear that even 
the 0th order vanishes. 2010.01.12 

6.6 Application: Diagonalization of the Dirac equation 

We will now give an application that is not at all trivial. If it were 1983, we could have 
published a paper in a prestigious scientific journal [lCor83l |Cor04i] . The dynamics of 
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a relativistic spin-i/s particle is described by the Dirac equation ["Ynd96T . The form of 
this equation follows from three requirements: gauge-covariance with respect to the inho- 
mogeneous Lorentz group which is comprised of Lorentz boosts, rotations in space and 
translations in space-time (3 + 3 + 4 = 10-dimensional) and the fact we are interested in 
5pm-V2 particles with mass m> [Wig39 |Ler961 iBuiSSI] . As long as the energy is below 
the pair creation threshold of 2 mc^, the Dirac equation accurately describes the physics of 
a spin-V2 particle. Here, c is the speed of light in vacuo. 

We will not attempt to give a derivation here, but only present the result: the Dirac 
equation can be written in standard form as 



d 

where the free Dirac hamiltonian for a particle with mass m > 

Hj^ = c^ml3 + ci-ihV^) • a 
contains the aj and (3 matrices, 

a A , f idc2 \ V 1 9 Q 

The aj G Matc(2), j = 1,2,3, are the usual Pauli matrices. For convenience, we choose 
units such that h=l and rescale the energy by i/c^ 

Ho ^Hd = mP + (-^Vj . a =: Ho(P). 

This suggests to use 

c 

Q'=x 

as building block operators. Using the relations 

ajP = — Paj, 

•= + oijOik = 25|^^idc4 kj = 1,2,3, 

we can show that 

Uo(P) := , ^ = ((£(P) + m) idc4 - (P • a)p), 

is unitary, i. e. it satisfies 

"o(PK(Pr = idi2=Uo(Pruo(P), 
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and diagonalizes Ho(P), 

Uo(P)Ho(P)uo(P)* =£(P)/3 := y/m^ + P^p. 

This means, the relativistic kinetic energy governs the free dynamics of relativistic par- 
ticles. The upper two components of wave functions in the diagonalized representa- 
tion correspond to particles ("electrons"), the lower two are associated to antiparticles 
("positrons"). As each of the energies is two-fold spin-degenerate, the unitary Uo(P) is not 
unique. 

If we add an electric field, we need to add the electrostatic potential, 

H(Q, P) := + P . a + ^y(Q) =: Ho(P) + ^.H^m 

As long as the particle's energy is below the pair creation threshold, we expect that elec- 
tronic and positronic degrees of freedom still decouple, at least in an approximate sense. 
Our choice of building block operators suggests that the relevant small parameter is Vc 



More preoperly in light of the discussion in Chapter 2.4 the small parameter should be 
dimensionless. To remedy this, we should in principle work with the ratio where Vq is 
some characteristic velocity of the system. If the particle is relativistic Vq > 0.1c, but not 
too fast, say Vq ^ 0.7c, we expect approximate decoupling in the presence of an electric 
field. 

Since P and Q do not commute, their commutator is of the order ^/c, and Uq(P) no 
longer diagonalizes H(Q, P). The usual recipe found in text books is to use the Foldy- 
Wouthuysen transform IIFW50[ |Tha92[ |Ynd96|] which successively diagonalizes H(Q, P) 
up to errors of higher order. The result is the non-relativistic Pauli hamiltonian and higher- 
order corrections, 



and it is often claimed that one gets the Taylor expansion of ^ m^ -\- around ^ = 0. 
This is correct, but neither a proof nor a derivation. This equation also does not accurately 
describe the dynamics of a relativistic particle, the kinetic energy is non-relativistic. How 
do we recover y + P^ as kinetic energy? The first who has found a way was Cordes 
in the mid-1980s IICor83ll . More sophisticated arguments have since appeared elsewhere 
IITeu03[ IFL08II which allow for the derivation of corrections in a systematic fashion. 
Let us do the calculation using Weyl calculus. In fact, all we need at this stage is 

fk = fg-li{f,g} + ^(VcO = fg + 0m. 
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6 Weyl calculus 



Then we immediately conclude 

Uo(P)ff(Q, P)uo(P)* = EiP)P + ^Uo(P)V(Q)uo(Pr 

= £(P)/3 + j,Dpiuo)Dp(V)Dp(ul) = £(P)^ + ^Dp(uottVflu*) 
= £(P)^ + ^£)p(uoVu* + 0m) = E(,P)I3 + ^V(Q) + 0^1 

A slightly more sophisticated argument shows that if in addition to the electric field, a 
magnetic field is added, then the Weyl quantization of Uq(x, ^) := Uq[x, E, — ^A(x)) diag- 
onalizes 

ff^ := mP + iP- ^A(Q)) • a + ^ V(Q) 

up to errors of order i/c^ 

In ChapterlSj we will present a systematic scheme to derive all higher-order corrections. 
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7 The semiclassical limit 



This section is the culmination of what we have done up to now and one of the two 
main topics of this lecture: we aim to make the fuzzy notion of 'semiclassical limit' into 
a mathematical theorem. The semiclassical limit can be understood as a scaling limit. 
Since even the physics of semiclassical limits is somewhat obscure, we start the topic by 
repeating the commonly chosen approach, namely the Ehrenfest 'theorem.' 

In Chapter [Z2{ we have discussed the Heisenberg picture in quantum mechanics where 
observables A are evolved in time, A(t) = e'^^~^^Ae~^~^^ , and states remain constant in time. 
Then A(t) obeys the Heisenberg equations of motion, 

at IS IS 

If we consider the standard hamiltonian H = + ^(Q) for simplicity, then the equa- 
tions of motion for position Q and momentum P are 

AQ(t) = le+^^^ [Q,H] e-^'s^ = e+^^^^e^^^ = ^ (7.1) 

and 

^ Pit) = ^e+^t^ [P,H] e-^i^ = ^e+^i^ [P, y(Q)] e"^^^ 

= -(V,y(Q))(t) = -V,y (Q(t)), (7.2) 

respectively. The last equality, — (V^y(Q))(t) = — V^y(Q(t)), is non- trivial and a con- 
sequence of the so-called spectral theorem | IRS81[ Section VIII. 3]. These equations look 
like Hamilton's equations of motion, equation ( |2.2| ) . 

If the particle is in state i/; e ^(H) c L^(R'^), equations ( |7.1D and ( [712] ) make predic- 
tions on the measured velocities, momenta and so forth: we simply compute the expecta- 
tion value. The first line, 

m ■■= (0(0) = (^) = ^^), (7.3) 

reproduces the relationship between averaged velocity and momentum of a nonrelativistic 
classical particle. The equation of motion for the expectation value of the momentum 
reads 

Pit) :=E^(P(t)) =E^(-V,y(Q(t))) = (V^,-V,y(Q(t))i/.). (7.4) 
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4^ 




Figure 7.1: The green wavepacket is sharply peaked and thus will have a 'good' semi- 
classical behavior. The red wavepacket's spread is too large, its expectation 
values of position and momentum are not expected to follow the trajectory of 
a classical point particle. 



This result is known throughout the physics community as Ehrenfest 'theorem.' We have 
put the word theorem in quotation marks since a mathematically rigorous proof equires 
one to make assumptions on H. Although it is possible to make these equations mathe- 
matically rigorous (e. g. IIFSIOIH , we shall make no such attempt. 

Equation ( |7.4D presents us with the challenge that we do not have direct access to the 
position variable q(t) = (Q(t)) as in general 

E^(-V,V(Q(t))) / -V,y(E^(Q(t))) (7.5) 

holds. Hence, it is still not clear in what way this implies classical behavior. Even in some 
approximate sense, left- and right-hand side are in general not equal as one can see from 



the red wavepacket in Figure 7.1 This means that in order for a semiclassical approx- 
imation in the sense of Ehrenfest to hold, the wave function must be sharply peaked in 
comparison to the variation of the potential V, i. e. one heuristically expects the semiclas- 
sical approximation to be more accurate for the sharply peaked green wavepacket rather 
than the red one. If the wave function has multiple peaks or varies on the same scale 
as the potential, we cannot expect semiclassical behavior in this description. Indeed, this 
point was very clear to Ehrenfest when he first proposed this approach in a 1927 paper 
jEhfia: 



Gleichung (5) [equations ( |7.3D and ( |7.4D ] besagt aber offenbar: Jedesmal, 
wenn die Breite des (Wahrscheinlichkeits-)Wellenpakets i/^i/^* (im Verhaltnis 
zu makroskopischen Distanzen) ziemlich klein ist, passt die Beschleunigung 
(des Schwerpunktes [q]) des Wellenpaketes im Sinne der Newtonschen Gle- 
ichungen zu der "am Orte des Wellenpaketes herschenden" Kraft [— V^V]. 
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Figure 7.2: The incoming wavepacket (green) is split into two outgoing contributions 



However, in most standard textbooks, these crucial remarks are not properly stressed: 
first of all, the validity of a classical approximation has little to do with 'h being small/ 
but rather with a separation of length scales. Secondly the Ehrenfest 'theorem' can serve 
as a starting point of a semiclassical limit if one restricts oneself to wave packets which are 
sharply peaked in real and reciprocal space. The prototypical example are Gaussian wave 
packets and their generalizations.^ 

So assume we start with a wave packet which is well-localized in space and reciprocal 
space, e. g. a Gaussian. How long does it remain a sharply focussed wave packet? Even in 
the absence of a potential, we know that the wave function broadens over time and the 



maximum of |i/^(x)| scales as t ^^'^ over time (see Proposition ( |5.1.10D ). Furthermore, if a 

potential is present, it is not at all clear if the wave packet breaks up or not. 

Let us consider a step potential V{x) = Vq{\ — Q{x)), Vq > where Q is the Heavyside 
function. Then from quantum mechanics, we know that upon hitting the step, part of 
the wavefunction will be reflected while another part of it will be transmitted, i. e. the 
wavepacket splits in two! Hence, slow variation of the potential and "smoothness" are 
crucial.^ The microscopic scale is given by the de Broglie wavelength associated to the 
wavefunction ip, 



^Interest in states of this form has surged in the 1980s and there is a vast amount of literature on this topic. An 
early, but well- written review from the point of view of physics, is a work by Littlejohn [|Lit86ll . For a detailed 
mathematical analysis, a good starting point are the publications of Hagedorn, e. g. [ |Hag80| |. 

^The smoothness condition can be lifted to a certain degree: the potential only needs to be smooth in regions 
the wave function can 'explore.' 



(red). 




2nh 



X = 
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7 The semiclassical limit 



whereas the macroscopic scale is given by the characteristic length on which V varies. In 
what follows, let s denote the ratio of microscopic to macroscopic length scales. Thus e is 
a dimensionless quantity and <. 1. Then if y(x) varies on the scale ^(1), V(ex) varies 
on the scale ^(e). Hence, the force Fg(x) := —V^(y(ex)) = —sV^V(ex) generated by 
y(^x) is weak and the equations of motion are given by 

. P 

X = — 
m 

p = -sV^Viex). 

It is crucial to understand that these equations are written from the perspective of the 
quantum particle, i. e. we use microscopic units of length! If we rewrite these equations 
on the macroscopic length scale, we need to use q := as position coordinate and we 
obtain 

P 

q = ex = e — 
m 

p = -eVqViq). 

Hence, forces are weak and we need to wait long times to see their effect, i. e. macroscopic 
times t := s/s. On this time and length scale, the extra factors of e in the equations of 
motion cancel. 



d 


d 


= d;'^ 




d 


d 







= -^v,y(q). 



In other words, upon rescaling space and time to macroscopic units, we arrive at the usual 
equations of motion of a classical particle. Now that we have found the correct micro- 
scopic and macroscopic scales, we can write down the Schrodinger equation in micro- 
scopic units of length and long, macroscopic times. We will keep the physical constant 
h = 1.0546 Js to emphasize that none of the arguments hinge on 'h being small.' Now, the 
Schrodinger equation reads 

ishj-^xl^.it) = (-^^v,)' + y(^x))i/^,(t), V^,(0) = 11^0^ ^ifl\ 

and the building block operators are 

Qmicro •= (^-6) 
^micro * x' 

The index s mip^ indicates that the solutions to the Schrodinger equation depend para- 
metrically on e. Alternatively, we could rescale space and introduce macroscopic position 
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q := sx which implies = sV^. Then in macroscopic units, the Schrodinger equation 
reads 

iE/i|^i/;,(t) = (^(-iE/iv,)'+y(q))i/;,(a ^m=^o^^(hi (7.7) 

Since we live in a macroscopic world, this scaling is in fact by far the most common 
choice. What is the relation between ip^ and i/ig? |'i/;g(t,x)|^ is a probability density 
and thus scales like i/voiume = e~^. This suggests to define the unitary rescaling operator 

U, : L^(R^) — >L^(R^) as 

Indeed, the prefactor is consistent with the change of variables formula: for all if,xlj G 
L^(R^), we have 

{U,if,U,xlj)=\ dq{U,if)iqT{UMq)=\ dq ^"^ 

dx(/?*(x)'i/;(x)= 

In this scaling, the building block operators are 

Qmacro (7-8) 
^macro • ^^^x' 

Looking at the Schrodinger equation in the macroscopic scaling, equation ( |7.7| ), one un- 
derstands why the semiclassical limit is often thought of as the h — >0 limit. Formally, the 
roles of s and h are interchangeable, physically they are not! In fact, the precise origin of 
s says something about the physical mechanism why a system behaves almost classically. 
In the next Chapter, for instance, e := ^jm is given by the square root of the ratio of the 
masses of light to heavy particles and e — > implies the dynamics of the heavy particles 
freezes out. 

Put together, we are now prepared to tackle the main questions of this section: 

(i) What is the semiclassical limit? 

(ii) Is it necessary to start with special 'semiclassical' states? 

(iii) How large is the error? 

(iv) Do we make the semiclassical limit for observables or for states? 
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The first point can be understood by comparing the frameworks (Chapter 2.3 ) and using 
Weyl calculus as developed in Chapter |6| we already know how to associate quantum 
observables to classical observables (Weyl quantization) and quantum states to quasi- 
classical states (Wigner transform) . Hence, the task we are left with is a comparison of 
classical and quantum dynamics. In the Heisenberg picture where observables are evolved 
in time and states remain constant, we can thus compare the quantization of the classically- 
evolved observable 

Fd(t):=£)p(/(t))=Dp(/o<^J (7.9) 

and the quantum mechanically evolved observable 

Fqjt) '=e^''s^^^^^Opif)e-''s^^^^^ (7.10) 

Here is the hamiltonian flow associated to the hamiltonian function h : E — > R and 
equation ( |2.2| ) and e"^^^^^^-* is the strongly continuous one-parameter time evolution 
group generated by Dp(/i) over macroscopic times. Ideally we would like to study dy- 
namics generated by fairly general hamiltonian functions (and associated operators) as 
well as observables. Typically they are smooth, but have polynomially growth: the stan- 
dard hamiltonian h(x, ^) = ^^^ + V(x), position x, momentum E, and angular momentum 
L = X aE, are of this type. A theorem of this generality is out of reach with the tools we 
have at our disposal, so we restrict ourselves to the case h,f G 5^(S). The reader rest 
assured that these somewhat artificial restrictions can be lifted using so-called oscillatory 
integral techniques [|Rob87l |Rob98|] . The general strategy remains the same, but each 
step in the derivation needs to be justified carefully. 

Theorem 7.0.1 (Semiclassical limit for observables) Let h e 5^(E) be a classical hamil- 
tonian function and 0f the hamiltonian flow associated to h via equations ( |2.2| ). Then for 
any (real-valued) observable f G 5^(S), the classical evolution approximates the quantum 
evolution in the folloM^ing sense: for any T > 0, there exists Cj- > such that F^^(t) and 
Fci(t) agree up to errors of order 0{e^)for all t G [— T, +r]. 



<Cje^\t\. (7.11) 



Remark 7.0.2 Question (ii) and (iii) are answered decisively: at no point do we make 
assumptions on the specific shape or form of the initial state, the above estimate holds in the 
operator norm, i. e. uniformly for all initial states! Furthermore, we get an explicit upper 
bound for the error term in the proof. 

It seems that the Heisenberg picture is singled out here, but we will also prove a semi- 



classical limit for states which follows from Theorem 7.0.1 and Corollary 6.3.6 
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The main ingredients in the proof are the so-called Duhamel trick (essentially an appli- 
cation of the fundamental theorem of calculus) as well as the asymptotic expansion of 
the Weyl product. A footnote to the educated reader: we purposely mimic the proof for 
symbols whose quantization define possibly unbounded selfadjoint operators and ignore 
some minor simplifications. 

Proof Let T > be arbitrary but fixed. As h,f G 5^(E) are real-valued and have an 
integrable symplectic Fourier transform, £)p(h) and Op(/) define bounded selfadjoint op- 



erators on L^(R^) (Proposition 6.2.2 and Theorem 6.2.3 ) and we do not have to consider 
domain questions. By Stone's theorem, L/(t) = e~^^^^^^is a unitary strongly continuous 
one-parameter evolution group. Hence, F^^(t) defines a bounded operator on L^(R^). 

By assumption on h, the components of the hamiltonian vector field and all its deriva- 
tives in X and ^ decay rapidly and are smooth. Thus, by Corollary |2.1.2 and Theo- 
rem 2.1.3 the hamiltonian flow (p^ exists globally in time and is also smooth. By LRob87l 
Lemma in.9] all derivatives of the flow are bounded uniformly in t G [— T, +7] and the 
composition /(t) = / o 0^ is again a Schwartz function."^ Since products of derivatives 
of Schwartz functions are Schwartz functions, we also have ^/(O = {KfiO} ^ ^(S)- 
Hence, F^i(t) also defines a bounded selfadjoint operator for all t G R. 

We will now use the so-called Duhamel trick to rewrite the difference of F^^(t) and 
Fci(t) as the integral over the derivative of a function via the fundamental theorem of 
calculus: for any if G 5^(R'^) c L^(R'^), we have 

F^M^-FM^ = e+^^^^^^^Dp(/)e-^^^^^V -i)p(/(t)V^) 

= j dse+^^^^^^^ (^^Dp(/i)Dp(/(t-s)) + 

+ ^£)p(/(t -s)) - iDp(/i)Dp(/(t -s)) j e-^^^^^V. 

The above integral exists as a Bochner integral, i. e. an integral of vector-valued func- 
tions."^ We can group the first and last term to give a commutator; furthermore, the 
commutator of the Weyl quantizations of two Schwartz functions is equal to the Weyl 
quantization of the Moyal commutator and once we plug in the asymptotic expansion of 



■^The idea of the proof is to write down equations of motion for the first-order derivatives of (p^- which involve 
second-order derivatives of h. These are bounded by assumption and thus imply that all first-order derivatives 
of the flow (pf. are bounded by a constant depending only on time. One then proceeds by induction to show 
the boundedness of higher-order derivatives, 
measurable function / : R — > 3C that takes values in a Banach space 5C is called Bochner integrable (or 
integrable for short) if and only if ||/ H^^i^^ '= /j^dt < oo- 
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the Weyl product. Theorem |6. 5. 2[ we obtain 

i [Dp(/i),Dp(/(t -s))] = ^Dp([h,/(t -s)]„) = ^Opi-ie{h,fit-s)} + a[e^)) 
= Dp{{h,f(t-s)}) + 0(e^l 

Since the Fourier transform : — > ^(S) is a continuous map and all derivatives 
of the flow d^d^cj), are bounded functions, \a\ + |^| > 1, (Lemma IIL9 from [iRobSyB ) 
which can be bounded independently of t G [—T,-\-T], we conclude we can also inter- 
change integration implicit in Dp and differentiation with respect to s as /(t — s) and 
^/(t — s) can be bounded uniformly in s G [0, t] and for all t G [— +r] . Hence, we can 
write the difference between the quantum observable F^^(t) and the classical observable 
F,i(t) as 

Jo 

= ( d5e+'7^P('')Dp(i[h,/(t-5)]„ + |(/(t-5)))e-'7°''(''V. 
Jo 



By Proposition 2.1.7 



l/(t -s) = -^/(t -s) = -{/i,/(t -s)} 

holds. On the other hand, we can expand the Moyal commutator with the help of the 
asymptotic expansion of the Weyl product, Theorem |6.5.2[ 

[h,/(t -s)]„ = -is{h,f(t-s)} + 0(s'). 



The error is really of third order in s as explained at the end of Chapter 6.5 and known 
explicitly. Put together, this implies 

i [/i,/(t -s)], + ^(/(t -s)) = i(-ie){/i,/(t -s)} - {h,f[t-s)} + if^Ce^) 

where the right-hand side - as a sum of Schwartz functions - is again a Schwartz function. 
Thus, its quantization is a bounded operator on L^(R^) and the estimate 



|Dp(i[/i,/(t-5)]„ + A(/(t-5)))| 



< CtS^ 



holds for all t G [— T, +r] and some constant Cj- > that depends on T. Taking the 
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operator norm of F^^^t) — F^i(t),we conclude 



|Mt)<^-Fc,(t)L(,.(«.))< [ d5||Dp(i[/t,/(t-5)]„ + A(/(t-5)))| 

Jo " ' 



This finishes the proof. 



Example Even though the following example does not satisfy the assumption of the pre- 
ceding theorem, it illustrates the semiclassical limit for a realistic hamiltonian and ob- 
sersable. Consider h(x,^) = -\- V(x) for some smooth bounded potential V with 
bounded derivatives up to any order. Then the Zth component of momentum has a 
good semiclassical limit: we can (formally) compute 

^Pjit) = -lOpih), P/t)] = ie+'i^PC') [Op(ft), Pj] e-'j^"™ = -5. V(Q(t)) 
at e s ^ 



s 



and 



tj(t) = {h,^j(t)} = -d,y(x(t)) 



explicitly. Naively one may think that there is no error term in the Moyal commutator 
since is a polynomial of first order and thus the asymptotic expansion of the Weyl 
commutator terminates after the first-order correction. However, we plug in the time- 
evolved observable ^jit) which clearly depends on the initial conditions (xo,^o) ^ cho- 
sen. Hence, in general, the higher-order terms in the Moyal commutator [/i, <^^(t)]p = 
-ie{h, ^jit)} -f ^(e^) are non-zero. 
Repeating the calculations in the proof (and ignoring technical problems), we arrive at 



Jo " ' 

= [ ds\\Dp(^-d,yix{t-s)) + 0is^) + d^y[xit-s))\ 

Jo " ' 



4 

Jo 
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Since the derivatives of the potential up to third order (which are needed to control the 
remainder) are bounded, Cj- < oo may be chosen. Hence, at least formally, momentum 
has a good semiclassical limit. Similarly position also has a good semiclassical limit. 

The semiclassical limit for states is a consequence of the semiclassical limit for observables. 



Corrolary 7.0.3 (Semiclassical limit for states) Let h e 5^(E) and T > 0. For any ip G 

L^(R^), we define ip^t) := e~^^^^^^V- Then the classically evolved signed probability mea- 
sure 

Mci(t) := (271)-^/^ Wiilj, ^) o (7.12) 
approximates the quantum mechanically evolved state 



up to errors of 0{e )in the sense that 



'qm 

2 



IxUt) :={2nr'l'W{^l^{t\^{t)) (7.13) 



J dXfiX) ((/iqm(t))(^) - Mmx)) = a{e^) V/ G ^{S) (7.14) 

holds for any t e [-T,+T]. 

Note that this is a statement on the time evolution of states: the observable / only 'tests' 
the difference between the quantum state iUqm(t) = {Iny^^^ W{'il){t),'4){ty) and the clas- 
sically evolved state /ici(t) = (27r)~''/^ (j>-t' 



Proof By Definition of the Wigner transform and Corollary 6.3.6 for any / G 5^(S), we 
can rewrite 



{i>,P^^m) = (^(t),Dp(/)^(t)) = J dX^(T/>(t),^(t))/(X) 

as a classical expectation value with respect to the signed probability measure fi^^^it) = 
(271)"''/^ ^(i/;(t),i/^(t)). Similarly the expectation value of the classically evolved observ- 
able can be rewritten as 



(^,Fd(t)^) = (^,Dp(/(t))V') = ^^ I dX/o<^,(X)(^(V',V))(n 



An application of Liouville's theorem, Theorem |2.1 .91 yields 

(271)^ 



(^,Fd(t)^) = -^ I dX/(X)(^(^,^)o<^_J(X)= I dX/(X)(Md(t))(X) 



Hence, by Theorem 7.0.1 [Ji^^it) - iUci(t) = ^ie^) in the sense of equation ( |7.14D . 
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One last word on wavepackets: there are three general types of wavepackets IITeu03[ 
pp.59-60]: 

(i) Localization in space and reciprocal space: let ifj G L^(R^) satisfy some mild techni- 
cal assumptions (i/^ G 5^(R^) works, but more general -functions are admissible 
as well). Then the -function 

xjjl (x) :=^-^Ae-;?o-(^-^o)^(^) 

is sharply peaked at Xq for e small while its Fourier transform is peaked around 
^Q. Wave packets of this type track the trajectory associated to the initial conditions 
(xq, <^o) ^ energy Eq = h(xQ, ^q). Typical examples are Gaussian wavepackets 

yd/2 _ 

where i/^(x) = -^e 2 . 

(ii) Localization in space or reciprocal space: for any G L^(R^), 

is peaked around for ^ < 1 while its probability distribution in real space is given 
by \^l^ix)\ , i. e. 

in the weak sense. Similarly the classical pseudo-probability distribution associated 
to 

is |i/i(^)p5(x — Xq). In both cases, one deals with a distribution of initial states in 
either real or reciprocal space. 

(iii) WKB ansatz: here, we assume the wave function is of the form 

The ansatz here is that the wave function associated to the probability distribution 
p has fast oscillations superimposed to a slowly varying enveloping function . 
Then 

holds in the weak sense as has been computed in exercise 27. This means, we start 
with an initial distribution of states in real space, each of which follows the classical 
trajectory associated to the classical action S. 



109 



7 The semiclassical limit 



110 



8 Theory of multiscale systems 



There are plenty of examples of systems with two or more inherent scales: 



(i) A classical spinning top: the Earth, for instance, rotates around its own axis about 
once every 24 hours (to be more exact, it is approximately 23 hours, 56 minutes and 
4.1 seconds). On the other hand, the axis of rotation precesses around the axis of 
precession about once ever 25, 700 - 25, 800 years. 



(ii) A quantum spinning top is realized by systems with L • S coupling in strong magnetic 
fields. Here, and are "almost good quantum numbers" [iSak94i p. 311] if the 
magnetic field B is strong. 



(iii) The Dirac equation as treated in Chapter 6.6 IIFL08II . 



(iv) Currents in crystalline solids including a Kubo formula which explains the quantiza- 
tion of the Hall current IIPST03all . 



(v) Polarization and piezoelectricity of crystalline solids [|Lei05[ IPST06I] . 



(vi) Physics of molecules in the Born-Oppenheimer approximation IIPST07II . 



(vii) Time-adiabatic problems of quantum mechanics [|PST03b[ Chapter 4.3], [|Kat66l1 . 

Our treatment is based on the seminal work of Panati, Spohn and Teufel PPST03b[ |Teu03IL 
space-adiabatic perturbation theory and we will introduce it by a specific example: 



8.1 Physics of molecules: the Born-Oppenheimer 
approximation 

Consider a molecule in consisting of N nuclei with masses m„, nG{l,...,iV}, and K 
electrons. If we consider electrons and nuclei as non-relativist particles, then, quantum 
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mechanically, the dynamics is generated by the Born-Oppenheimer hamiltonian: 



The potential energy has three distinct contributions: an electron-electron interaction 
term, an electron-nucleon interaction term and a nucleon-nucleon interaction term. Here, 
x„ G R"^ and y^. G R-^ are the coordinates in R^^ of the nth nucleon and /cth electron, 
respectively. 

If, for instance, one assumes that the particles interact via the Coulomb potential and 
the nuclei have (effective) charges Z„, nG{l,...,iV}, then the potentials are given by 



K N ry 



It is often advantageous both, for practical and theoretical considerations, to smudge out 
the charge so as to get rid of the Coulomb singularities. Also, the bare Coulomb potential 
is often replaced by an effective nucleonic potential that takes into account the fact that 
electrons associated to lower shells do not play a role in chemical reactions. We will 
always assume that the potentials are smooth functions in x. 
The time evolution is generated by the Schrodinger equation. 



i/i--^(t) = Hbo^CO, ^(0) = ^0 ^ L^(R^^ X R^^). 

o t 

To simplify the discussion, we have neglected here that electrons are fermions and as such 
obey the Pauli exclusion principle, i. e. ^^us also ^(t) should be antisymmetric 

when exchanging two electron coordinates and y^. Before we continue our analysis, 
we will simplify the hamiltonian: first of all, we choose units such that h = 1 and by 
a suitable rescaling, we may think of the nuclei having the same mass Tn^^^. Then the 
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8.1 Physics of molecules: the Born-Oppenheimer approximation 



Born-Oppenheimer hamiltonian can be written as 

^ n=l ^ k=l 

+ Ve-eiy) + K-nuc(^. j) + V^uc-nucC^)- 



This suggests to introduce e := y^e/m^^^ < 1 as the small parameter and by choosing 
units such that = 1, we finally obtain 

=4P2+H,(Q). 

Physically s is always small since a proton is about 2,000 times heavier than an elec- 
tron. This means even in the worst case, e < V44. Typical atomic cores contain tens of 
hadrons, i. e. m^uc — 10^ — lO^mg, which means that very often the numerical value of 
e is significantly smaller than 1/44. Hence, we have introduced the usual building block 
observables 

P := -ieV^ (8.2) 
Q'=x 

and we can use the results from Chapter [6j 

The physical intuition tells us that electrons adjust instantaneously to the configuration 
of the nuclei, i. e. they are enslaved by the nuclei, and P and Q are approximate constants 
of motion, 

[Hbo.P] = ^(^) 
[Hbo.Q] = ^(^). 

The strategy to solve this from a physicist's perspective is the following: 2010.01.20 
(i) For fixed nucleonic positions, one solves the eigenvalue problem 

where the nucleonic positions x = (xi, . . . ,Xjq) G are regarded as parameters 
and (/?„(x) G J^fast '= L^(M^^) is an electronic wave function. More properly, one 
looks at the spectrum of H^^x) for each value of x which gives rise to energy bands 
(bound states where the molecule exists) and usually also a part with continuous 
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Figure 8.1: A simple example of a band spectrum. The relevant part of the spectrum is 
separated from the remainder by a gap [iSTOlll . 



spectrum (which means the molecule has dissociated) . Typically, the energy values 
E„ are sorted by size, i. e. 

Ei(x)<E2(x)<... 



(ii) For physical reactions, there are usually "relevant hands" e. g. the ground state and 
the first excited state. These two need to be separated by a gap from the rest of the 
spectrum so that one may neglect band transitions from the "relevant bands" to the 
rest. 

(iii) The last step is an "adiahatic approximation:'' one can neglect band transitions to 
other bands if the total energy is bounded independently of e, e. g. if 

\\P^^^\\ = \\eV,^^\\ = a{l). 

Then there are several options: 

(a) One treats the nucleons classically, e. g. in case of an isolated relevant band, one 
considers the hamilton function 

where the ground state energy band acts as an effective potential. For us, the 
semiclassical approximation is a distinct and separate step (see Chapter [83] ). 
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(b) Or one treats nucleons quantum mechanically. Usually one does a product 
ansatz here: let x ^ L^(M.^^) be a nucleonic wave function and ipi(x)^ L^(M?^) 
be the electronic ground state. Then we define the product wave function 

Related to this, we define the projection onto the electronic ground state 

noM := |(/?i(x))((/?i(x)|i2(^3i.3. 

For each x, tTqCx) is an operator on the electronic subspace L^(R^^) = ^fast- Its 
quantization ttqCQ) is an operator on the full Hilbert space L^(R^^ x R*^^) which 
maps each ^ G L^(R^^ x R^^) onto a product wave function, 

(7ro(Q)^)(x,j)= ((/?i(x),^(x,-))i2(j^3ic) ^iix,y). 

In order to compute how H^q acts on product wave functions 

'^ix,y) = xM(^^ix,y), 
we compute the expectation value: since 

holds, we can compute the expectation values separately for kinetic and poten- 
tial energy: the potential energy term can be inferred from 

(He(Q)z^i)(^, j) = xMH,Mif,(x,y) = E,MxM^iix,y) 
= iE,iQ)x^i)iix,y). 

The kinetic energy term splits in four: using the symmetry of P, we get 
(PM/,PM/)= f dx f dyiPx^iyix,y)iPx^iyx,y) 

J^3N J^3K 

= 1 dx f dy UPxyM^lix,y) + x%x){P^,yix,y))- 

J^3N J^3K ^ 

•((Pz)W^i(^,y) + zW(P^i)(^.y)). 

Now we rewrite this expression by introducing the scalar product on L^(R"^^), 
using that the fast state (/?i(x) is normalized for all x G R^^ , 
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and by introducing the Berry phase ||Ber84[ ISim83[ iBMK+OSll 

j2^{x) '=i{Lp^{x)y^Lp^{x))^2^^,Ky (8.3) 
This leads to the following simplifications if we complete the square: 

(PM,^PM,)= f dxr(P;^)*(x)(P;^)(x)+ 

- (P/)*(x) . ^^(x);^(x) - x%x)8j4\x) . (P/)(x)+ 
+ ^'/*W(V^(/Pi(x),.V^(/Pi(x))^2^^3^3;^(x)) 



= (;^,(P-^^(Q))';^),™+ 



/.(V,(/Pi(Q),(id-|(/Pi(Q))((/;i(Q)|)V,(/Pi(Q));^)^ 



This suggests to propose 

71o(Q)Hbo^o(Q) =\{^- ^^(Q))' + ^i(Q) + 70(Q) (8.4) 
as ejfective hamiltonian for the nuclei where the last term, 

0(x):= (v,(/Pi(x),.(id-|(/Pi(x))((/Pi(x)|)V,(/Pi(x))^^^^^^^ (8.5) 

is the so-called Born-Huang potential. 
What are the problems associated to this simplistic approach? 

(i) The nuclei move slowly, their typical velocities are of order ^(e), and we have to 
wait for times of order 0(^1 e) to see motion on the scale Hence, it is necessary 
to include all 0{e) corrections. 

(ii) Since [71o(Q),Hbo] — ^(^)^ ^^^i" times of order 0(^1 e), has the initial product wave 
function evolved into an approximate product wave function? If so, how big is the 
error after macroscopic times? In other words, does 

(e"^ '"^^^ ;^ (/^i ) (x, y ) = f (x) (/^i (x, y ) + ^(^) 

hold for some x ^ L^{^^^)1 Naively, the answer would be no: if we compare the full 
evolution e~^^^^o to that generated by 71o(Q)Hbo^o(Q)^ then a naive approximation 
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10 20 30 40 50 



Time (a.u.) 

Figure 8.2: The wave functions oscillates in and out of the relevant subspace [ ]Wya05[ 
p. 162]. 



yields 



ds 



-i(^-5)7ro(Q)HBo^o(Q) 



This is not good enough for our intents and purposes since the errors may sum up to 
order In reality the integral is highly oscillatory and we will show in Chap- 

ter 8.4.4 that the errors stay 0{e) small. The way one should think about it is 



this: the wave function oscillates quickly in and out of the space of product states 
71o(Q)l2(r3^ X r3J^) (see Figure [s^. 



(iii) Last, but not least, what about higher-order corrections? Are we able to improve our 
approximation by some scheme? 
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8.2 The adiabatic trinity 

Born-Oppenheimer systems have three basic features that are shared by many other mul- 
tiscale systems (e. g. the treatment of the relativistic spin-i/s particle presented in Sec- 
tion |6.6p . We will call them adiabatic trinity: 

(i) A distinction between slow and fast degrees of freedom. The nuclei move much 
slower than the electrons. This suggests to split the total, physical Hilbert space 
J^phys := L^(M^^ X R^^) into slow (nucleonic) and fast (electronic) Hilbert space, 

^slow ^ ^fast ^'(IR'^) ^ 

(ii) A small dimensionless parameter e <il which quantifies the separation of scales. In 
our situation, e := \l ^^/m^^^ quantifies the ratio of typical velocities of electrons and 
nuclei if they are assumed to have similar momentum. 

(iii) A relevant part of the spectrum ofH^^Q^x, ^) (an operator on ^f^st) separated by a gap 
from the remainder Here, we are interested in a collection of bands 

o-reii^,^) := [j{Ejix)} 

that are physically relevant, e. g. the ground state band. Here, ^ is a finite index 
set, L := < 00. There may be band crossings between relevant bands, but 
the relevant bands must not cross, merge or touch the remainder of the spectrum 
crjg^(x, ^) := a [H^^q^x, ^)) \ (J^-Qiix, In other words, there must be a gap 

q := inf dist (o-^eiCx, a^'^^Cx, ?)) > 0. 

This gap suppresses transitions from relevant bands to states in the remainder of the 
spectrum exponentially. Hence, we can consider the contributions from the relevant 
2010.01.26 part of the spectrum separately. 

Since this is a conceptual problem, we will use "conceptual notation," e. g. J^hys instead 
of L^(R^^ X R^^) and J^f^io^ instead of L^(R^^). We are initially given a hamiltonian 
operator 

CO 

ft, = H,(Q, P) := ^"^n(Q. P) '= Hbo 

n=0 

which acts on J^hys — L^(M^^ x R^^). We allow the more general situation where 
is a power series in s. The idea is that for ^ > 0, the nucleonic kinetic energy acts as a 
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"perturbation" for the case ^ = 0. If the nuclei are infinitely more heavy than the electrons, 
£: = 0, their dynamics freezes out and we can arrange all bits and pieces into the following 
diagram: 



n 



-ituoHe=0"o 



"0 



phys 



n 

^slow ^ ^fast 



no 



(8.6) 



-itnQHg^QfiQ 



^^slow®< 

g-ifnref«0^e=0"o^ref 



In the unperturbed case, the dynamics on the physical Hilbert space J^hys — L^(M^^ xR-^^) 
is generated by e~^^^'=^ = e~^^^e(Q)^ Associated to the family of physically relevant bands, 
property (iii) of the adiabatic trinity. 



we can defined the associated /amiZj of projections 



(8.7) 



Here, the ^j(x) are the eigenfunctions of the electronic hamiltonian He(x) associated to 

Fear each x G , H^^x) G ^(^,J^ast) is a selfadjoint operator with dense domain 
c c^fast, i- e. He is an operator-valued function. Without loss of generality, we may 
assume the (/?^(x), ; G ^, are normalized to 1 for all x G R^^, 

If a band Ej is fc-fold degenerate, the index set ^ is such that the band is counted k times, 
i. e. we associate k orthonormal vectors to this band which span the eigenspace associated 
to Ej{x). 

If we quantize jZq to Kq := 7ro(Q), it becomes an operator on J^phys- The subspace 



^O-^phys ^ -^phys 
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corresponds to those states where the electronic (fast) part belongs to the relevant sub- 
space. By assumption, we are interested in the dynamics of initial states that are associated 
to the relevant electronic bands. Since the unperturbed hamiltonian H^^q commutes with 
Uq, [Hg^Q, Uq] = 0, the operator Uq is a constant of motion and thus the dynamics gen- 
erated by the unperturbed hamiltonian He=o leave Ttgc^hys invariant. This means, the 
unitary time evolution on the relevant subspace ttq Jt^hys is generated by 

In our case, the projected hamiltonian is equal to 

where Einstein's summation convention is used (i. e. repeated indices in a product are 
implicitly summed over) . 

So far, we have only used property (iii) of the adiabatic trinity, the existence of physi- 
cally relevant bands. With this, we have explained the left column of the diagram. The 
right column is associated to property (i), i. e. the existence of slow (nucleonic) and fast 
(electronic) degrees of freedom. We now introduce a unitary Uq which facilitates the sep- 
aration into slow and fast degrees of freedom. While it is superfluous in the unperturbed 
case, it is essential once the perturbation is switched on. Let {Xj}jG/ ^ ^fast be a set of 
orthonormal vectors in the fast Hilbert space. These vectors are arbitrary and nothing 
of the subsequent construction will depend on a specific choice of these vectors. For all 
X G R^^ , we define the unitary operator 

Uo(x) := \Xj){^jM\+u^M : ^^st ^fast (8.8) 

where (x) is associated to the remainder of the spectrum and is such that Uq{x) is a uni- 
tary on .^asf "o(^) maps the x-dependent vector ^jix), j G ^, onto the x-independent 
vector Xj, 

UoM^jix) = Xj = Uo(xO(/?;(xO. 

The quantization Uq := Uq(Q) is a unitary map from Jifp^ys to Jt^siow ^ ^fast — ^^(M^^) ^ 
l^Q^SKy simple product state ^j, G L^(R^^), ; G /, is mapped onto 

(UO0^;)(X, j) = K(Q)0^;)(^. j) = 0(x)l/o(x)(/?/x, j) = (t>iix)Xjiy). 

In the new representation, product states are mapped onto even simpler product states 
where the electronic component is independent of x. Since Uq is a unitary operator, the 
full dynamics can be transported to the unitarily equivalent space, 

Uoe-''^^=% = e-^^"oH.=o"S . ^^^^^ ^^^^^ ^ ^^^^^ 
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This is not an approximation, just a convenient rewriting by choosing a suitable repre- 
sentation. The states associated to the relevant bands can also be written in the new 
representation: if we view ttq Jt^hys a subspace of ^hys? makes sense to write 

^O^O^phys — ^O^O^o ^O^phys = ^^O^O^o^^^slow ^ ^ast)- 

We dub this unitarily transformed projection reference projection fl^^f = id^^^^^ n^^f, 

Tlref := Uo(x)7ro(x)Uo(x)* 

= (IZ;)(^;WI+"oW)l^fcW)(^fcWI( 

= \Xj){Xj\' 

By construction, the reference projection does not depend on the slow (nucleonic) coor- 
dinates. We can and will view 

ftrefC^slow ^ ^fast) = ^slow ^ (^ref^fast) — ^slow ^ 

either as subspace of J^siow ^ ^fast more commonly as J^siow <S) cl^^l. In case of the 
latter, each Xj ^ ^fast is identified with a canonical base vector of cl^L Hence, operators 
on flrefC^siow ^ ^ast) — ^siow <S) cl'^l Can be thought of as matrix-valued operators just 
like the Dirac hamiltonian. The dynamics on Jt^iow d-^l is generated by 



slow ^ 



' ^slow ^ 



These dynamics describe the full unperturbed dynamics exactly if we start with initial 
conditions associated to the relevant bands. 

For the unperturbed case, i. e. £• = 0, we have found a way to systematically exploit the 
structure of adiabatic systems. What happens if we switch on the perturbation, i. e. we 
let ^ > 0? Let us postulate that if the separation of scales is big enough, i. e. ^ < 1 small 
enough, there exists an analogous diagram 



n 



phys 



Us 



^-itUsH.U* 

n 

-^slow ® -^fast 



(8.9) 



g-itn^eff/gHeUlnref 
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where some ^ -dependent unitary : J^hys — ^ ^siow ^ ^fast facilitates the splitting into 
slow and fast degrees of freedom and a projection : J^phys — ^ ^phys is somehow still 
associated to the relevant bands o-^^i^x) = [Jj^j{Ej(x)}. If the nucleonic kinetic energy 
can be regarded as a perturbation, it would be natural to ask that 

holds in some sense. Implicitly, we have postulated that ft^ and have the following 
four properties: 

n^ = n, [H„Y[,] = ^ien (8.10) 

The first column merely tells us that fig and are a projection and a unitary while the 
second tells us the two are adapted to the problem. Ultimately we are interested in the 
dynamics generated by the effective hamiltonian 

which generates the dynamics on the reference space J^siow ^ d^L It describes the dy- 
namics accurately if we start with an initial state in the subspace ft^J^hys- Compared to 
the naive subspace Ttg^phys. this subspace is tiled by an angle of order ^(s). 

Panati, Spohn and Teufel have constructed ftg and recursively order-by-order via a 
"defect construction." Before we go into detail, let us introduce Weyl calculus for operator- 
valued functions which is then used to reformulate equation ( |8.1QD . This allows us to 



make use of the asymptotic expansion of the Weyl product proven in Chapter 6.5 



8.3 Intermezzo: Weyl calculus for operator-valued functions 

We need to modify Weyl calculus to accommodate operator-valued functions. Let be a 
separable Hilbert space and ^(J^) the Banach space of bounded operators on J^. Our 
goal is to quantize functions / : S — > ^(J^) that take values in the bounded operators 
on J^. The simplest example which we have already gotten to know in Chapter [63| are 
matrix-valued symbols. For all intents and purposes, the reader may think of = C^. 
As usual, we start by defining the Weyl system: for each X G S 

:=e-^"^^'^Q'P^^0id^ 
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defines a unitary operator on L^CR^, J^) = l2(R^)(8)J^. With this, for any/ e 5^ (E, ^(J^)), 
the Weyl quantization 

defines a bounded operator on L^(R^, J^). The proof is virtually identical to that of 



Proposition 6.2.2 This also induces a Weyl product [t that is given by the same formula as 
before, 

which needs to be interpreted slightly differently though. This product has the same 
asymptotic expansion, 

fk=fg-ei{f,g} + 0(.e^l 

Here, f g is defined pointwise via the operator product on ^(J^), 

(fgXX) :=f(X)g(,X). 

This has some profound consequences: for instance, the Poisson bracket of an operator- 
valued function with itself does not vanish, 

d d 

{/,/} - ^^rf \n = E^^?/' ^^J^ 

1=1 1=1 

since the derivative of / with respect to ^ in general does not commute with the derivative 
of / with respect to x. Hence the Weyl commutator of / and g has non-trivial terms from 
the zeroth order on, 

U,g'\i=fh-gU = U,g'\-e-^{{f,g} - {gJ}) + 0{e^). 

This will be important in Chapter[83]where we discuss the semiclassical limit of multiscale 
systems. 



8.4 Effective quantum dynamics: adiabatic decoupling to all 
orders 

Now that we have introduced Weyl quantization for operator-valued functions, let us 
rewrite the defining properties of 
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and 

which are now assumed to be quantizations of the ^(J^f^st) -valued functions n and 

TTttTT = 71 + [H,, 7r]p = (8.11) 

Uttu* = 1 + U*ttu = 1 + uttTTttu* = Tlref + ^(^~). 

We will expand n ^ S^o^"^" " ^ S^o^""" asymptotically. We guess that the 
zeroth order terms are - as suggested by the notation - tzq and Uq as defined by equa- 
tions dsTT] ) and ( [8^ since 

uottu* = 1 + 0{8l u*K = 1 + 0{e) UoBtioK = 7i,ef + 0{e). 

follows immediately from the asymptotic expansion of []. In case of the first equation, for 
instance, we use that tZq^x) is a projection on J^f^g^ for each x G R^^ and obtain 

Our goal is to construct the effective hamiltonian 

^eff •= ^reiMHetl^l'^rei = Tr^ef uttH^Bu* TT^ef (8.12) 

since its quantization gives the effective dynamics of states associated to the relevant 
bands. Let us start with the construction of the tilted projection. 

8.4.1 Construction of the projection 

We can split each term tt^ of the projection into a (block) diagonal term 

:= UqUj^Uq + (1 - 7ro)7r„(l - Uq) 

and an offdiagonal term 

One can check easily that 7r„ = tt^ + n^^. Let us focus on the first-order correction Ui'. it 
turns out that the first equation, the projection defect 

ttoBtTo - no =: eG^ + ^ie^) = -^{no, 71^} + ^ie^l 

^For technical reasons, we must distinguish between Opg(n) and and similarly between £)pg(u) and U^. One 
can recover and from Opgin) and Op^iu), see [,Teu03i p. 82 f., p. 87 f.] for details. In computations, 
one need not distinguish between these objects. 



124 



8.4 Effective quantum dynamics: adiabatic decoupling to all orders 



only determines the diagonal term. To see that, let us substitute Uq + en^^ into the above 
equation and do the computation: 

(tto + ^OflC^io + sn^^) - {no + sn^^) = 

This means we could have substituted any offdiagonal 71°° into the equation, the term 
cancels out and its choice does not affect the projection defect G^. One can show that the 
diagonal part can be computed from Gj via 

n° = -fioG-^no + (1 - 7io)Gi(l - tIq)- 

The diagonal part needs to be taken into account when computing the commutation defect 

[H„7ro + e7r°]j=:eFi + ^(e2) 

= e[[H„ no] + [Ho, <] - ^{Ho, no] + ^{no,Ho}) + &is^) 

from which the offdiagonal part can be determined. Since by assumption 

0ie^) = [H„ no + sn^] „ = [H„ no + en°] „ + [H„ 571°°] „ 
= e(Fi + [Ho,d) + ^(e2) 

holds, we conclude that the offdiagonal part n^^ must satisfy 

[Ho, 71?^] =-F^. (8.13) 

In the presence of band crossings within the relevant bands, equation ( |8.13D cannot be 
solved in closed form and one has to resort to numerical schemes and an alternative ansatz 
IITeuOSi p. 79]. If there are no band crossings, the solution is given by 

Higher-order terms can be devised in a similar manner by recursion: assume we have 
computed the first n terms starting from a given Kq, 



k=0 
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Then we can get the diagonal part tt^^^ from the projection defect 
by the same equation as before, 

<+l = -^O^n+l^O + (1 - ^o)Gn+l(l - ^o). (8.14) 

The difficulty to solve 

[Ho,Ci] = -^^n+i (8.15) 

for the offdiagonal part tt^^^ explicitly remains where the right-hand side is the commu- 
tation defect 

8.4.2 Construction of the intertwining unitary 

The first order of the unitary is easy to construct once Ui has been computed. We make 
the ansatz that we write = (a^ + bi)uQ as the sum of a symmetric part = a* and an 
antisymmetric part bi = — b* which is akin to decomposing a complex number into real 
and imaginary part. Hence, this is not a restriction or a special assumption: any bounded 
operator A : J^ast — ^ ^ast can be written as 

C = |(C + C*) + |(C - C*) =: A + B. 

The symmetric part follows from the unitarity defect 

uottu* - 1 =: Ml + ^(s^) 

and is given by 

ai = -lA,. 

Equivalently, Uq^Uq — 1 could have been used to deduce a^. We have to plug in into the 
intertwining defect 

{uq + ^aiUo)B(^o + ^7ri)B(i/o + ^aiUo)* - n,^f =: eB^ + ^C^^) 

which can also be solved for hi explicitly, 

b,:=[n,,M. (8.16) 
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This recipe immediately generalizes to higher-order terms: if we have computed u up to 
errors of order ^(^""^^), 



fc=0 

we can solve for 

a„+i = -^A„+i (8.17) 

where is the unitarity defect 

The antisymmetric part is again related to the intertwining defect 
through the equation 

^+l = [^ref.5,+i]. (8.18) 

Put together, these give the n + 1th order term of the unitary, 2010.02.02 

= (a„+i + b„+i)uo. (8.19) 

8.4.3 The effective hamiltonian 

The zeroth order of the effective hamiltonian 

^effO = ^ref"o^o"o ^ref 

is obtained by simply replacing the Weyl product with the pointwise product and u as well 
as with the zeroth-order terms in their respective asymptotic expansions. To compute 
terms of higher orders, it is useful to compute the unitarily transformed hamiltonian first, 

h := uiHju\ (8.20) 

Then a closed form for hi can be deduced from expanding the left-hand side of 

u^H, - ho^u = shi^u + ^(s^) = sh^Uo + ^(s^) (8.21) 

to first order. The main simplification is that the left-hand side consists of single Weyl 
products instead of double Weyl products which are much more difficult to expand. This 
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relation is key to computing higher-order terms in h and /i^ff without making mistakes by 
leaving out terms. If h^^^ := 2fc=o^^^fc transformed hamiltonian where all terms 

up to nth order have been computed, then a closed expression for can be found by 
computing the left-hand side of 

Once we collect all terms of first order, can be expressed as 

= [uiu*, /lo] +U0H1U* + |({/io,Uo} - {uo,Ho})u*. (8.22) 

We emphasize that it helps tremendously to collect terms that are alike because there 
are typically a lot of non-obvious cancellations. The effective hamiltonian is obtained by 
conjugating with n^^^, 

^effl = ^ref^l^ref 

= 7rref[UiU*,/lo]7lref +7rrefUoHiU*7rref- |7lref({/lo.Wo} " {Uq, Ho})u* Tl^ef. (8.23) 

Let us go back to the original problem of the Born-Oppenheimer system where the rel- 
evant bands consist of an isolated, non-degenerate band, (J^q\{x) = {E^(x)}. Then the 
hamiltonian is the Weyl quantization of 

H,ix, 0=ie^ id^,„ + H,ix) =\e+ H,[x) = H^ix, 

and Tij-ef = I/) (/I for some x ^ ^fast ^i^h norm 1. The leading order term of the effective 
hamiltonian is exactly what we expect from equation ( |8.4| ), 

^effo(^. ?) = TTj-ef Uo(x)Ho(x, ?)u*(x) TT^ef 

= l/)(/l(l/)(^*WI+u,^(x))(|?2+He(x))(|(/;,(x))(;^ 

= C2^' + E,M)\x){xl (8.24) 

To compute the first-order correction, we need to work a little: we need to compute only 
two terms since UqHiU* = 0. The first one vanishes identically as well as we are interested 
in the case of a single band, 

^ref [^^l^O^^o] ^ref = ^ref "l^^S^O ^ref " ^ref ^O^l^o ^ref 



128 



8.4 Effective quantum dynamics: adiabatic decoupling to all orders 



Here, the usefulness of grouping terms properly shows. Similarly, the second term also 
simplifies since the Weyl product is bilinear and the Weyl product of two functions of 
position only reduces to the pointwise product. This means, we get 

d 

= i^ref{?^"oK^ref = Y^i^l \x){x\ \x){d.,^.\ \^.){x\ \x){x\ 
1=1 

= ?-i(V^(/?„(/?,)^^^^^ \x){xl 
Since (^^(x) has norm 1 for each x G , we can differentiate 

{if^ixlif^M)^^^^ = i 

with respect to x to conclude (V^(/?^, (/?^)^^^^^ = - ((/?^, V^(/?^)^^^^ . Thus, we can express 
heff 1 in terms of the Berry phase. 

Kin = ^7rref({^o."o} - {lio.^o})"o^ref = ' ^ ^rei- 

This term contains the geometric phase associated to the band E^. If we complete the 
square (which gives another error of order £:^), we see that acts like a "magnetic vector 
potential," 

^eff = ^effO + ^^effl + ^(^') = (5?' + Tl^ef " • ^ Tl^ef + ^(^') 
= (^(? - SJ^/f + E*(X)) Tlref + ^(s^l 

Associated to jz/, there is a pseudo-magnetic field H := djz/, the so-called Berry curvature. 
This field modifies quantum and classical dynamics alike if time-reversal symmetry of the 
system is broken by e. g. a magnetic field B or in the presence of band crossings. In 
our particular case, cr^^i^x) = {E^(x)} and B = 0, the pseudo-magnetic field D. = can be 
shown to vanish identically. 

8.4.4 Effective Dynamics 

The last - and most important - piece of the puzzle is to approximate the full dynamics 
generated by e~^^^' on J^phys for states in the relevant subspace f^s-^phys ^he dynamics 
generated by the effective hamiltonian h^^f := Opeih^ff) on J^siow ^ d^l . 

As a first step, let us show that it suffices to compute only finitely many terms of h^ff. 
assume we are interested to approximate the dynamics for times of the order €{}le) and 
we would like to make a total error of at most 0{s). We claim that it suffices to compute 
only hgffo and h^ffi- the usual trick, we rewrite the difference of the two evolution 
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groups as an integral and then reduce the difference between the evolution groups to the 
difference of their respective generators: 



Kif p-i(^-5X^effO+e^effl) 



io ^ 



(8.25) 



This means even for macroscopic times {€{}le) on the microscopic scale), the difference 
between e~'^^'''xlj and e'^^^^^ff^+^^^ff^V for any initial state i/; G J^^i^w ^ is small, 
i. e. of order ^(e). 

Now let us explain in what way the effective hamiltonian can be used to approximate 
the full dynamics: let us reconsider our favorite diagram ( |8.9D where we have defined 
h := Dpg(/i) and h^ff := Dpg(/ieff) to simplify notation. 



n 



phys 



n 

^slow ^ ^fast 



^ ^slow ^ 

U 



Now we show that the dynamics in the upper-left corner, the physical Hilbert space, by 
the dynamics in the upper-right space by using = Dpg(u) + 0(e^), 

and the usual Duhamel trick: 



-itH, 



Dp,(u)*e-^^^Dp,(u) + ^(E~) 



= J d5 — (e-^^^^ Dp,(u)*e-^^^-^%p,(u)) + ^(e°°) 
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Expanding the terms yields that the two evolution groups differ only by ^(s^) 

...= f ds(e-'^««(-i/?JDp,(u)*e-'(^-^%p,(u)+ 
Jo 

= f dse-^^^^iDp,(u*tt/ittu-HjDp,(u)*e-^^^-^%p,(u) + ^(^^ 
Jo 

= ^(s^) (8.26) 

Furthermore, if we start with a state in the relevant subspace, i. e. ft^J^hys^ then we can 
use the effective hamiltonian /i^ff to approximate the full dynamics: by equation ( |8.26| ) and 
n,=Dp,(7i) + ^(E°°), we get 

e-^^^n, = Dp,(ure-^^Dp,(u)Dp,(7r) + ^(e^ 

= Dp,(u)*e-^^n,efi)p,(u) + ^(^-) 

= Dp,(u)*e-^^^e^^ n,ef i)p,(u) + (8.27) 

Finally since we are interested in times of order 0{^le) and in a total precision of 0{s), 
we combine equation ( |8.25| ) with equation ( |8.27| ) to get the final result, 

e-^i^^ = Dp,(u)*e-^'^^ff Href £)p,(u) + ^(f°°) 

= Dp,(u)*e-^'^^^ffo+"^^ffi^ n,ef Op,(u) + 0{s) 

= Dp,(uo)*e-^K^effo+^/iefn) n,ef £)p,(uo) + ^(^) 

= u* e-^K^effo+^^effi) n^^f uo + ^(^). (8.28) 

This result tells us that if we start with states that are associated to the relevant bands, we 
can approximate the dynamics generated by the Born-Oppenheimer hamiltonian 

HBO=|P'+He(Q) 

with that of the first two terms of effective hamiltonian, i. e. the quantization of 2010.02.03 



8.5 Semiclassical limit 

As a last step in the analysis, we would like to make a semiclassical limit. We emphasize 
the semiclassical approximation is distinct from the adiabatic approximation. These two 
are very often mixed in other approaches. 
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Although there is a semiclassical theory for degenerate bands and several bands P Rob98[ 
ITeuOSIL for simplicity we consider the case where the relevant part of the spectrum con- 
sists of an isolated, non-degenerate hand, (J^q\{x) := {E^(x)}. We have shown in Chap- 
ter 8.4.3 that the effective hamiltonian is to leading order the quantization of 

Now we would like to make a semiclassical approximation for the dynamics of observ- 
ables. It turns out that this is only possible for observables that are compatible with the 
splitting into slow and fast degrees of freedom. 

Definition 8.5.1 (Macroscopic observable) A macroscopic observable e ^(^phys) is 
the quantization of a ^{^^^^^-valued function 

such that f commutes pointwise with the hamiltonian H^, i. e. 

[fix, an.ix, =fix, OH.ix, - H,ix, Ofix, O = G ^(^fest) 
holds for all (x, ^) e S. 

This implies / is compatible with the band structure of H^. With respect to the fast 
electronic degrees of freedom, it is a constant of motion. Even with respect to the slow 
nucleonic degrees of freedom, it is still an approximate constant of motion, 

[H„f]^=[H„f]+0ie) = 0ie). 

=0 

In a way this implies is oblivious to the details of the fast dynamics. Mathematically 
this key property is needed to ensure that the Weyl commutator which appears in the 
Egorov theorem (Theorem 7.1) is small. Associated to a macroscopic observable / (we 
will use / synonymously with its quantization F^) is its effective observable 

defined analogously to the effective hamiltonian (equation ( |8.12D ). This effective ob- 
servable describes the physics if we are interested in initial conditions associated to the 
relevant band in the representation fascilitated by = Dpg(u) + ^(e°°). The condition 
[Hg(x, ^),f(^x, (^)] = ensures that /^ff is compatible with this procedure, i. e. 

[n„Dp,(/)] = ^(E) 
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and 

[n,ef,£)p,(/eff)]=0 

hold. If we identify J^siow ^ ^i^h J^siow^ then according to the diagram, 



n 



n 

^slow ^ -^ast 



there is a clear interpretation of these objects: Dpg is the physical observable we seek 
to measure in experiments. This physical observable lives on the physical Hilbert space 
J^hys ^i^d its dynamics is governed by e~^^^'. The ejfective observable lives on the reference 
space J^siow ^i^d its dynamics is governed by the effective time evolution e~^^^^effo"^^^effi^ up 
to errors of order ^(e^). 

Let us compute the first few terms of the effective observable /^ffi the leading-order 
term is given by the partial average over the electronic state if^, 



(8.29) 



The first-order term also simplifies as we are interested in an isolated non-degenerate 
band: by equation ( |8.23D and arguments completely analogous to those in Chapter [8.4.3[ 
the subleading term computes to be 



/effl = ^rei [UiU * J] TT^ef +0 + ^ TT^ef ({/effQ. Up} " {Uq J }) U* Tl^ef 
=0 

where we have used 

= -J2/i. 



(8.30) 



=^fast 



=^fast 
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Example Two exceptionally important observables are our effective building block observ- 
ables 

Xgff := Tij-ef uttxttu*7ri.ef = X + 0{s^) (8.31) 

?eff '= Tlref Utt?ttw*7lref = ? " ^^(x) + €{e^). 

At least up to errors of order 0{e^) they look like the usual position observable and kinetic 
momentum associated to the "magnetic" field e^Lij := d^^^sjz/j) — d^,{ej2^i). Since there 
is a prefactor of s, this means, the "magnetic" field is weak. It is not surprising that this 
pseudomagnetic field eVt appears on the level of the classical equations of motion if we 
increase our precision to include terms of 0{e). 

Let us define the map : S — > 

T,:(x,?)-(x,?-e^(x)) + ^(e2) 

which replaces (x, ^) by the effective observables (x^ff, ^^ff) = (x, — ej?^(x)). This is 
in fact akin to adding a magnetic field to classical mechanics by minimal substitution, 
i. e. replacing momentum ^ with kinetic momentum ^ — A(x). We propose the following 

Lemma 8.5.2 Let f he a macroscopic observable. Then up to errors of order 0{e^\ /^ff is 
equal to the minimally substituted observable n^^^u^^f o rg)u* n^^^, 

feff(.x, ?) = (/ ° n)(x, ?) + 0(,e^) = / (x, ? - ej^/M) + 0(s'). (8.32) 

Proof We Taylor expand f oT^ around ^ = and get 

(/ O r,)(x, ?) = / (X + ^is^l ? - SJ^M + ^(8^)) 

= f{x, e) - 8V^f{x, ?) . ^(X) + €{8^1 

If we multiply the above expression with Tij-ef Uq from the left and u* Tij-gf from the right, it 
coincides with /^ffg + ^/effi as given by equations ( |8.29D and ( |8.3QD . □ 

This means, / and /eff are related by minimal substitution and averaging over the fast 
degrees of freedom. 

Theorem 8.5.3 (Semiclassical limit) Let f be a macroscopic observable. Then the full 
quantum dynamics of a Born-Oppenheimer system for an isolated non-degenerate band 
can be approximated by the flow generated by 

-id oim iv. 



(k' + £*(x)) (8.33) 



in the follovi^ing sense: 

= 0(e^) (8.34) 



■ qmv 
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Proof First of all, from Chapter 8.4.4 (equation ( |8.28| )), we know we can replace the full 
dynamics e'^^ftg by 

This can be plugged into 

^ V ' 

=/effO + ^(^) 

Now we make the semiclassical approximation: since we can regard Opg^h^ffQ + ^/leffi) 
and Dpg(/effo) as operators on ^^i^^^C^ = J^siow = I^^CM^^), we can invoke Theorem 7.1, 
the Egorov-type theorem: by the usual trick, we relate quantum and classical dynamics: 
first of all, since (x^ff, ^^ff) = (x, ^) + ^(e^), we can replace the effective variables by 
usual variables in the classical equations of motion, equation ( |8.33D . We also see that the 
hamiltonian in these equations of motion is nothing but /i^ffoCx, |<?^ + E^(x). 

Now we can make a semiclassical argument to replace the full quantum mechanics 
generated by h^^^Q + ^/igff i and compare it to the quantization of the classically evolved 
observable, 

rt , 

= [ dse+'t(^-,„+^^»'")Dp,(i[/leffo + efteffl,/effo(t-s)]„ + 

Jo 

To show that the integral is small, we need to show that 

~ [KffO + ^^effl^/effo(f ~ "5)] H ~ {^effO^/effo(f ~ " 

= ; [^effO + ^^effl Jeffo(f " s)] + ^(^) 

is of order 0(e). In the single-band case, /^effo + ^^effi /effo scaZar-valued functions 
and hence the pointwise commutator vanishes, 

[/leffo + ^/leff 1 Jeffo(t " s)] = 0. 
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Thus, we have shown that equation ( |8.34| ) holds. 

This concludes the proof. □ 

The proof already contains the core idea how to push the error to €{e^y. one has to 
compute /ieff2 and then include the pseudomagnetic field sVL in the semiclassical equations 
of motion. However, since the pseudomagnetic field is weak, on our time scale and with 
our level of precision, it will not affect the dynamics. 
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